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PRODUCTION OF HYDROXYLATED PATTY ACIDS 
IN GENETICALLY MODIFIED PLANTS 

CROSS-REFERENCE TO RELATED APPLICATION 

This application is a continuation-in-part of U.S. patent 
application Serial No. 08/320,982, filed October 11, 1994, 
which itself is a continuation-in-part of U.S. patent 
application Serial No. 08/314,596, filed September 26, 1994, 
now abandoned. The entire contents of U.S. patent application 
Serial No. 08/320,982 and U.S. patent application Serial No. 
08/314,596 are hereby incorporated by reference and relied 
upon. 

GOVERNMENT RIGHTS 

The invention described herein was made in the course of 
work under grant number DE-FG02-94ER20133 from the U.S. 
Department of Energy and grant No. MCB93052 69 from the 
National Science Foundation. Therefore, the U.S. Government 
has certain rights under this invention. 

TECHNICAL FIELD 

The present invention concerns the identification of 
nucleic acid sequences and constructs, and methods related 
thereto, and the use of these sequences and constructs to 
produce genetically modified plants for the purpose of 
altering the fatty acid composition of plant oils, waxes and 
related compounds. 

DEFINITIONS 

The subject of this invention is a class of enzymes that 



introduce a hydroxyl group into several different fatty acids 
resulting in the production of several different kinds of 
hydroxylated fatty acids. In particular, these enzymes 
catalyze hydroxy lat ion of oleic acid to 12 -hydroxy oleic acid 
5 and icosenoic acid to 14-hydroxy icosenoic acid. Other fatty 

acids such as palmitoleic and erucic acids may also be 
substrates. Since it is not possible to refer to the enzyme 
by reference to a unique substrate or product, we refer to the 
enzyme throughout as kappa hydroxylase to indicate that the 

10 enzyme introduces the hydroxyl three carbons distal (i.e., 

away from the carboxyl carbon of the acyl chain) from a double 
bond located near the center of the acyl chain. 
;^ The following fatty acids are also the subject of this 

L : invention: ricinoleic acid, l2-hydroxyoctadec-cis-9-enoic acid 

15^; (120H-18:1 ); lesquerolic acid, 14-hydroxy-cis-ll-icosenoic 
acid (14OH-20: l c,sA11 ) ; densipolic acid, 12-hydroxyoctadec-cis- 
9,15-dienoic acid (120H-18:2 ' ); auricolic acid, 14- 
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hydroxy-cis-11, 17-icosadienoic acid (14OH-20:2 ' ); 
hydroxyerucic , l6-hydroxydocos-cis-13-enoic acid (160H- 
20 22 : l c1sA13 ) ; hydroxypalmitoleic, l2-hydroxyhexadec-cis-9-enoic 

(120H-16: l clsAS> ) ; icosenoic acid (20 : l c,sA11 ) . It will be noted 
that icosenoic acid is spelled eicosenoic acid in some 
countries . 

25 BACKGROUND 

Extensive surveys of the fatty acid composition of seed 
oils from different species of higher plants have resulted in 
the identification of at least 3 3 structurally distinct 
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monohydroxylated plant fatty acids, and 12 different 
polyhydroxylated fatty acids that are accumulated by one or 
more plant species (reviewed by van de Loo et al. 1993) . 
Ricinoleic acid, the principal constituent of the seed oil 
from the castor plant Ricinus communis (L.) , is of commercial 
importance. We have previously described the cloning of a 
gene from this species that encodes a fatty acid hydroxylase, 
and the use of this gene to produce ricinoleic acid in 
transgenic plants of other species (see U.S. patent 
application Serial No. 08/320,982, filed October 11, 1994). 
The scientific evidence supporting the claims in that patent 
application were subsequently published (van de Loo et al., 
1995) . The use of the castor hydroxylase gene to also produce 
other hydroxy lated fatty acids such as lesquerolic acid, 
densipolic acid, hydroxypalmitoleic, hydroxyerucic and 
auricolic acid in transgenic plants is the subject of this 
invention. In addition, the identification of a gene encoding 
a homologous hydroxylase from Lesquerella fendleri, and the 
use of this gene to produce these hydroxylated fatty acids in 
transgenic plants is the subject of this invention. 

Castor is a minor oilseed crop. Approximately 50% of the 
seed weight is oil (triacylglycerol) in which 85-90% of total 
fatty acids are the hydroxylated fatty acid, ricinoleic acid. 
Oil pressed or extracted from castor seeds has many industrial 
uses based upon the properties endowed by the hydroxylated 
fatty acid. The most important uses are production of paints 
and varnishes, nylon-type synthetic polymers, resins, 
lubricants, and cosmetics (Atsmon 1989) . In addition to oil, 



the castor seed contains the extremely toxic protein ricin, 
allergenic proteins, and the alkaloid ricinine. These 
constituents preclude the use of the untreated seed meal 
(following oil extraction) as a livestock feed, normally an 
important economic aspect of oilseed utilization. 
Furthermore, with the variable nature of castor plants and a 
lack of investment in breeding, castor has few favorable 
agronomic characteristics. For a combination of these 
reasons, castor is no longer grown in the United States and 
the development of an alternative domestic source of 
hydroxylated fatty acids would be attractive. The production 
of ricinoleic acid, the important constituent of castor oil, 
in an established oilseed crop through genetic engineering 
would be a particularly effective means of creating a domestic 
source . 

Because there is no practical source of lesquerolic, 
densipolic and auricolic acids from plants that are adapted to 
modern agricultural practices, there is currently no large- 
scale use of these fatty acids by industry. However, the 
fatty acids would have uses similar to those of ricinoleic 
acid if they could be produced in large quantities at 
comparable cost to other plant-derived fatty acids (Smith 
1985) . Plant species, such as certain species in the genus 
Lesquerella, that accumulate a high proportion of these fatty 
acids, have not been domesticated and are not currently 
considered a practical source of fatty acids (Hirsinger, 
1989) . This invention represents a useful step toward the 
eventual production of these and other hydroxylated fatty 




acids in transgenic plants of agricultural importance. 

The taxonomic relationships between plants having similar 
or identical kinds of unusual fatty acids have been examined 
(van de Loo et al., 1993). In some cases, particular fatty 
5 acids occur mostly or solely in related taxa. In other cases 

there does not appear to be a direct link between taxonomic 
relationships and the occurrence of unusual fatty acids. In 
this respect, ricinoleic acid has how been identified in 12 
genera from 10 families (reviewed in van de Loo et al., 1993). 

10 ^ Thus, it appears that the ability to synthesize hydroxylated 

fatty acids has evolved several times independently during the 
u. radiation of the angiosperms. This suggested to us that the 

enzymes which introduce hydroxyl groups into fatty acids arose 
by minor modifications of a related enzyme. Indeed, as shown 

15 H 1 herein, the sequence similarity between Al2 fatty acid 

^ desaturases and the kappa hydroxylase from castor is so high 
H that it is not possible to unambiguously determine whether a 
particular enzyme is a desaturase or a hydroxylase on the 
basis of evidence in the scientific literature. Similarly, a 

20 patent application (PCT/US93/09987) that purports to teach the 

isolation and use of Al2 fatty acid desaturases does not teach 
how to distinguish a hydroxylase from a desaturase. In view 
of the importance of being able to distinguish between these 
activities for the purpose of genetic engineering of plant 

25 oils, the utility of that application is limited to the 

several instances where direct experimental evidence (e.g., 
altered fatty acid composition in transgenic plants) was 
presented to support the assignment of function. A method for 
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distinguishing between fatty acid desaturases and fatty acid 
hydroxylases on the basis of amino acid sequence of the enzyme 
is also a subject of this invention. 

A feature of hydroxylated or other unusual fatty acids is 
5 that they are generally confined to seed triacylglycerols, 

being largely excluded from the polar lipids by unknown 
mechanisms (Battey and Ohlrogge 1989; Prasad et al., 1987). 
This is particularly intriguing since diacylglycerol is a 
precursor of both triacylglycerol and polar lipid. With 
10=y castor microsomes, there is some evidence that the pool of 
mi ricinoleoyl-containing polar lipid is minimized by a 

preference of diacylglycerol acyltransf erase for ricinoleate- 
m containing diacylglycerols (Bafor et al. 1991) . Analyses of 
Q vegetative tissues have generated few reports of unusual fatty 
15 hi acids, other than those occurring in the cuticle. The cuticle 
i=i contains various hydroxylated fatty acids which are 

interesterif ied to produce a high molecular weight polyester 
which serves a structural role. A small number of other 
exceptions exist in which unusual fatty acids are found in 
20 tissues other than the seed. 

The biosynthesis of ricinoleic acid from oleic acid in 
the developing endosperm of castor (Ricinus communis) has been 
studied by a variety of methods. Morris (1967) established in 
double-labeling studies that hydroxylation occurs directly by 
25 hydroxyl substitution rather than via an unsaturated-, keto- 

or epoxy-intermediate . Hydroxylation using oleoyl-CoA as 
precursor can be demonstrated in crude preparations or 
microsomes, but activity in microsomes is unstable and 
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variable, and isolation of the microsomes involved a 
considerable, or sometimes complete loss of activity (Galliard 
and Stumpf , 1966; Moreau and Stumpf , 1981) . Oleic acid can 
replace oleoyl-CoA as a precursor, but only in the presence of 
CoA, Mg 2+ and ATP (Galliard and Stumpf, 1966) indicating that 
activation to the acyl-CoA is necessary. However, no 
radioactivity could be detected in ricinoleoyl-CoA (Moreau and 
Stumpf, 1981) . These and more recent observations (Bafor et 
al., 1991) have been interpreted as evidence that the 
substrate for the castor oleate hydroxylase is oleic acid 
esterified to phosphatidylcholine or another phospholipid. 

The hydroxylase is sensitive to cyanide and azide, and 
dialysis against metal chelators reduces activity, which could 
be restored by addition of FeSO A , suggesting iron involvement 
in enzyme activity (Galliard and Stumpf, 1966) . Ricinoleic 
acid synthesis requires molecular oxygen (Galliard and Stumpf, 
1966; Moreau and Stumpf 1981) and requires NAD (P) H to reduce 
cytochrome b5 which is thought to be the intermediate electron 
donor for the hydroxylase reaction (Smith et al., 1992). 
Carbon monoxide does not inhibit hydroxylation, indicating 
that a cytochrome P450 is not involved (Galliard and Stumpf, 
1966; Moreau and Stumpf 1981). Data from a study of the 
substrate specificity of the hydroxylase show that all 
substrate parameters (i.e., chain length and double bond 
position with respect to both ends) are important; deviations 
in these parameters caused reduced activity relative to oleic 
acid (Howling et al., 1972). The position at which the 
hydroxy 1 was introduced, however, was determined by the 



position of the double bond, always being three carbons 
distal. Thus, the castor acyl hydroxylase enzyme can produce 
a family of different hydroxylated fatty acids depending on 
the availability of substrates. Thus, as a matter of 
convenience, we refer to the enzyme throughout as a kappa 
hydroxylase (rather than an oleate hydroxylase) to indicate 
the broad substrate specificity. 

The castor kappa hydroxylase has many superficial 
similarities to the microsomal fatty acyl desaturases (Browse 
and Somerville, 1991). In particular, plants have a 
microsomal oleate desaturase active at the Al2 position. The 
substrate of this enzyme (Schmidt et al., 1993) and of the 
hydroxylase (Bafor et al., 1991) appears to be a fatty acid 
esterified to the sn-2 position of phosphatidylcholine. When 
oleate is the substrate, the modification occurs at the same 
position (A 12) in the carbon chain, and requires the same 
cof actors, namely electrons from NADH via cytochrome b 5 and 
molecular oxygen. Neither enzyme is inhibited by carbon 
monoxide (Moreau and Stumpf, 1981), the characteristic 
inhibitor of cytochrome P450 enzymes. 

There do not appear to have been any published 
biochemical studies of the properties of the hydroxylase 
enzyme (s) in Lesquerella. 

Conceptual basis of the invention 

In U.S. patent application No. 08/320,982, we described 
the use of a cDNA clone from castor for the production of 
ricinoleic acid in transgenic plants. As noted above, 



biochemical studies by others had suggested that the castor 
hydroxylase may not have strict specificity for oleic acid but 
would also catalyze hydroxylation of other fatty acids such as 
icosenoic acid (20:l c,sA11 ) (Howling et al., 1972). Based on 
these studies, our previous application No. 08/320,982 noted 
in Example 2 that the expression of the castor hydroxylase in 
transgenic plants of species such as Brassica napus and 
Arabidopsis thaliana that accumulate fatty acids such as 
icosenoic acid (20:l c1sA11 ) and erucic acid (13-docosenoic acid; 
22:l c,sA13 ) would be expected to accumulate some of the 
hydroxylated derivatives of these fatty acids due to the 
activity of the hydroxylase on these fatty acids. We have now 
obtained additional direct evidence for such a claim based on 
the production of ricinoleic, lesquerolic, densipolic and 
auricolic fatty acids in transgenic Arabidopsis plants and 
have included such evidence herein as Example 1. 

In example three of the previous application, we taught 
the various methods by which the castor hydroxylase clone and 
sequences derived thereof could be used to identify other 
hydroxylase clones from plant species such as Lesquerella 
fendleri that are known to accumulate hydroxylated fatty acids 
in seed oils. In this continuation we have provided an 
example of the use of that aspect of the invention for the 
isolation of a novel hydroxylase gene from Lesgxzerella 
fendleri . 

In view of the high degree of sequence similarity between 
A 12 fatty acid desaturases and the castor hydroxylase (van de 
Loo et al. , 1995) , the validity of claims for the use of 



desaturase or hydroxylase genes or sequences derived therefrom 
for the identification of genes of identical function from 
other species must be viewed with skepticism. In this 
application, we teach a method by which hydroxylase genes can 
be distinguished from desaturases and describe methods by 
which A 12 desaturases can be converted to hydroxylases by the 
modification of the gene encoding the desaturases. A 
mechanistic basis for the similar reaction mechanisms of 
desaturases and hydroxylases was presented in the earlier 
patent application (No. 08/320,982). Briefly, the available 
evidence suggests that fatty acid desaturases have a similar 
reaction mechanism to the bacterial enzyme methane 
monooxygenase which catalyses a reaction involving oxygen-atom 
transfer (CH 4 -* CH 3 0H) (van de Loo et al. # 1993). The cof actor 
in the hydroxylase component of methane monooxygenase is 
termed a jn-oxo bridged diiron cluster (FeOFe) . The two iron 
atoms of the FeOFe cluster are liganded by protein-derived 
nitrogen or oxygen atoms, and are tightly redox-coupled by the 
covalently-bridging oxygen atom. The FeOFe cluster accepts 
two electrons, reducing it to the diferrous state, before 
oxygen binding. Upon oxygen binding, it is likely that 
heterolytic cleavage also occurs, leading to a high valent 
oxoiron reactive species that is stabilized by resonance 
rearrangements possible within the tightly coupled FeOFe 
cluster. The stabilized high-valent oxoiron state of methane 
monooxygenase is capable of proton extraction from methane, 
followed by oxygen transfer, giving methanol. The FeOFe 
cofactor has been shown to be directly relevant to plant fatty 




acid modifications by the demonstration that castor stearoyl- 
ACP desaturase contains this type of cof actor (Fox et al. , 
1993) . 

On the basis of the foregoing considerations, we 
5 hypothesized that the castor oleate hydroxylase is a 

structurally modified fatty acyl desaturase, based upon three 
arguments. The first argument involves the taxonomic 
distribution of plants containing ricinoleic acid. Ricinoleic 
acid has been found in 12 genera of 10 families of higher 
10 ";9 plants (reviewed in van de Loo et al., 1993). Thus, plants in 
which ricinoleic acid occurs are found throughout the plant 
H kingdom, yet close relatives of these plants do not contain 
ffi the unusual fatty acid. This pattern suggests that the 
O ability to synthesize ricinoleic acid has arisen (and been 
15 slS lost) several times independently, and is therefore a quite 
O recent divergence. In other words, the ability to synthesize 
ricinoleic acid has evolved rapidly, suggesting that a 
relatively minor genetic change in the structure of the 
ancestral enzyme was necessary to accomplish it. 
20 The second argument is that many biochemical properties 

of castor kappa hydroxylase are similar to those of the 
microsomal desaturases, as discussed above (e.g., both 
preferentially act on fatty acids esterified to the sn-2 
position of phosphatidylcholine, both use cytochrome b5 as an 
25 intermediate electron donor, both are inhibited by cyanide, 

both require molecular oxygen as a substrate, both are thought 
to be located in the endoplasmic reticulum) . 

The third argument stems from the discussion of oxygenase 
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cof actors above, in which it is suggested that the plant 
membrane bound fatty acid desaturases may have a /j-oxo bridged 
diiron cluster-type cof actor, and that such cof actors are 
capable of catalyzing both fatty acid desaturations and 
hydroxylations, depending upon the electronic and structural 
properties of the protein active site. 

Taking these three arguments together, it was 
hypothesized that kappa hydroxylase of castor endosperm is 
homologous to the microsomal oleate A 12 desaturase found in 
all plants. The evidence supporting this hypothesis was 
disclosed in the previous patent application (No. 08/320,982). 
A number of genes encoding microsomal A 12 desaturases from 
various species have recently been cloned (Okuley et al., 
1994) and substantial information about the structure of these 
enzymes is now known. Hence, in the following invention we 
teach how to use structural information about fatty acyl 
desaturases to isolate kappa hydroxylase genes of this 
invention. This example teaches the method by which any 
carbon-monoxide insensitive plant fatty acyl hydroxylase gene 
can be identified by one skilled in the art. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1A-D show the mass spectra of hydroxy fatty acids 
standards (Figure 1A, o-TMS-methylricinoleate; Figure IB, O- 
TMS-methyl densipoleate; Figure 1C, O-TMS-methyl- 
lesqueroleate; and Figure ID, O-TMS-methylauricoleate) 

Figure 2 shows the fragmentation pattern of 



trimethylsilylated methyl esters of hydroxy fatty acids. 

Figure 3A shows the gas chromatogram of fatty acids 
extracted from seeds of wild type Arabidopsis plants. Figure 
3B shows the gas chromatogram of fatty acids extracted from 
seeds of transgenic Arabidopsis plants containing the fahl2 
hydroxylase gene. The numbers indicate the following fatty 

cisA9 12 

acids: [1] 16:0; [2] 18:0; [3] 18:lcisA9; [4] 18:2 ' ; [5] 
20:0; [6] 20:l cis * 11 ; [7] 18: 3 cis ^ 12 ' 15 ; [8]22:l cis * 13 ; [9] 24:l cis * 13 ; 
[10]ricinoleic acid; [11] densipolic acid; [12] lesquerolic 
acid; [13] auricolic acid. 

Figures 4A-D show the mass spectra of novel fatty acids 
found in seeds of transgenic plants. Figure 4A shows the mass 
spectrum of peak 10 from Figure 3B. Figure 4B shows the mass 
spectrum of peak 11 from Figure 3B. Figure 4C shows the mass 
spectrum of peak 12 from Figure 3B. Figure 4D shows the mass 
spectrum of peak 13 from Figure 3B. 

Figure 5 shows the nucleotide sequence of pLesq2 (SEQ ID 

NO: 1) . 

Figure 6 shows the nucleotide sequence of pLesq3 (SEQ ID 
NO : 2 ) . 

Figure 7 shows a Northern blot of total RNA from seeds of 
L. fendleri probed with pLesq2 or pLesq3. S, indicates RNA is 
from seeds; L, indicates RNA is from leaves. 

Figures 8A-B show the nucleotide sequence of genomic 
clone encoding pLesq-HYD (SEQ ID NO: 3), and the deduced amino 
acid sequence of hydroxylase enzyme encoded by the gene (SEQ 
ID NO: 4) . 

Figures 9A-B show multiple sequence alignment of deduced 



amino acid sequences for kappa hydroxylases and microsomal Al2 
desaturases. Abbreviations are: Rcfahl2, fahl2 hydroxylase 
gene from R. communis (van de Loo et al . , 1995); Lffahl2, 
kappa hydroxylase gene from L. fendleri; Atfad2, fad2 
desaturase from Arabidopsis thaliana (Okuley et al,, 1994); 
Gmfad2-1, fad2 desaturase from Glycine max (GenBank accession 
number L43920) ; Gmf ad2-2 , fad2 desaturase from Glycine max 
(Genbank accession number L43921) ; Zmfad2, fad2 desaturase 
from Zea mays (PCT/US93/09987) ; Rcfad2, fragment of fad2 
desaturase from R. communis (PCT/US93/09987) ; Bnfad2, fad2 
desaturase from Brassica napus (PCT/US93/09987) ; LFFAH12 .AMI, 
SEQ ID NO: 4; FAH12.AMI, SEQ ID NO: 5; ATFAD2 . AMI , SEQ ID NO: 6; 
BNFAD2 . AMI , SEQ ID NO: 7; GMFAD2 — 1 . AMI , SEQ ID NO: 8; GMFAD2- 
2. AMI, SEQ ID N0:9; ZMFAD2 .AMI , SEQ ID NO:10; and RCFAD2 . AMI , 
SEQ ID NO: 11. 

Figure 10 shows a Southern blot of genomic DNA from L. 
fendleri probed with pLesq-HYD. E=EcoRI, H = Hindlll, X = 
Xbal. 

Figure 11 shows a map of binary Ti plasmid pSLJ44024. 

SUMMARY OF THE INVENTION 

This invention relates to plant fatty acyl hydroxylases. 
Methods to use conserved amino acid or nucleotide sequences to 
obtain plant fatty acyl hydroxylases are described. Also 
described is the use of cDNA clones encoding a plant 
hydroxylase to produce a family of hydroxylated fatty acids in 
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transgenic plants. 

In a first embodiment, this invention is directed to 
recombinant DNA constructs which can provide for the 
transcription or transcription and translation (expression) of 
the plant kappa hydroxylase sequence. In particular, 
constructs which are capable of transcription or transcription 
and translation in plant host cells are preferred. Such 
constructs may contain a variety of regulatory regions 
including transcriptional initiation regions obtained from 
genes preferentially expressed in plant seed tissue. In a 
second aspect, this invention relates to the presence of such 
constructs in host cells, especially plant host cells which 
have an expressed plant kappa hydroxylase therein. 

In yet another aspect, this invention relates to a method 
for producing a plant kappa hydroxylase in a host cell or 
progeny thereof via the expression of a construct in the cell. 
Cells containing a plant kappa hydroxylase as a result of the 
production of the plant kappa hydroxylase encoding sequence 
are also contemplated herein. 

In another embodiment, this invention relates to methods 
of using a DNA sequence encoding a plant kappa hydroxylase for 
the modification of the proportion of hydroxylated fatty acids 
produced within a cell, especially plant cells. Plant cells 
having such a modified hydroxylated fatty acid composition are 
also contemplated herein. 

In a further aspect of this invention, plant kappa 
hydroxylase proteins and sequences which are related thereto, 
including amino acid and nucleic acid sequences, are 
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contemplated. Plant kappa hydroxylase exemplified herein 
includes a L&squerella. fendleri fatty acid hydroxylase- This 
exemplified fatty acid hydroxylase may be used to obtain other 
plant fatty acid hydroxylases of this invention. 

In a further aspect of this invention, a nucleic acid 
sequence which directs the seed specific expression of an 
associated polypeptide coding sequence is described. The use 
of this nucleic acid sequence or fragments derived thereof, to 
obtain seed-specific expression in higher plants of any coding 
sequence is contemplated herein. 

DETAILED DESCRIPTION OF THE INVENTION 

A genetically transformed plant of the present invention 
which accumulates hydroxylated fatty acids can be obtained by 
expressing the double -stranded DNA molecules described in this 
application. 

A plant fatty acid hydroxylase of this invention includes 
any sequence of amino acids, such as a protein, polypeptide or 
peptide fragment, or nucleic acid sequences encoding such 
polypeptides, obtainable from a plant source which 
demonstrates the ability to catalyze the production of 
ricinoleic, lesquerolic, hydroxyerucic (16-hydroxydocos-cis- 
13-enoic acid) or hydroxypalmitoleic ( 12-hydroxyhexadec-cis-9- 
enoic) from CoA, ACP or lipid-linked monoenoic fatty acid 
substrates under plant enzyme reactive conditions. By "enzyme 
reactive conditions" is meant that any necessary conditions 
are available in an environment (i.e. , such factors as 



temperature, pH, lack of inhibiting substances) which will 
permit the enzyme to function. 

Preferential activity of a plant fatty acid hydroxylase 
toward a particular fatty acyl substrate is determined upon 
comparison of hydroxylated fatty acid product amounts obtained 
per different fatty acyl substrates. For example, by "oleate 
preferring" is meant that the hydroxylase activity of the 
enzyme preparation demonstrates a preference for oleate- 
containing substrates over other substrates. Although the 
precise substrate of the castor fatty acid hydroxylase is not 
known, it is thought to be a monounsaturated fatty acid moiety 
which is esterified to a phospholipid such as 
phosphatidylcholine. However, it is also possible that 
monounsaturated fatty acids esterified to 

phosphatidylethanolamine, phosphatidic acid or a neutral lipid 
such as diacylglycerol or a Coenzyme-A thioester may also be 
substrates. As noted above, significant activity has been 
observed in radioactive labelling studies using fatty acyl 
substrates other than oleate (Howling et al. , 1972) indicating 
that the substrate specificity is for a family of related 
fatty acyl compounds. Because the castor hydroxylase 
introduces hydroxy groups three carbons from a double bond, 
proximal to the methyl carbon of the fatty acid we term the 
enzyme a kappa hydroxylase for convenience. Of particular 
interest, we envision that the castor kappa hydroxylase may be 
used for production of i2-hydroxy-9-octadecenoic acid 
(ricinoleate) , 12-hydroxy-9-hexadecenoic acid, 14-hydroxy-ll- 
eicosenoic acid, 16 -hydroxy- 1 3 -docos^moic acid, 9-hydroxy-6- 
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octadecenoic acid by expression in plants species which 
produce the non-hydroxylated precursors. We also envision 
production of additionally modified fatty acids such as 12- 
hydroxy-9 , 15-octadecadienoic acid that result from 
5 desaturation of hydroxylated fatty acids (e.g. , 12-hydroxy-9- 

octadecenoic acid in this example) . 

We also envision that future advances in the genetic 
engineering of plants will lead to production of substrate 
fatty acids, such as icosenoic acid esters, and palmitoleic 

10 acid esters in plants that do not normally accumulate such 

fatty acids. We envision that the invention described herein 
may be used in conjunction with such future improvements to 
produce hydroxylated fatty acids of this invention in any 
plant species that is amenable to directed genetic 

15 modification. Thus, the applicability of this invention is 

not limited in our conception only to those species that 
currently accumulate suitable substrates. 

As noted above, a plant kappa hydroxylase of this 
invention will display activity towards various fatty acyl 

20 substrates. During biosynthesis of lipids in a plant cell, 

fatty acids are typically covalently bound to acyl carrier 
protein (ACP) , coenzyme A (CoA) or various cellular lipids. 
Plant kappa hydroxylases which display preferential activity 
toward lipid-linked acyl substrate are especially preferred 

25 because they are likely to be closely associated with normal 

pathway of storage lipid synthesis in immature embryos. 
However, activity toward acyl-CoA substrates or other 
synthetic substrates, for example, is also contemplated 
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herein. 

Other plant kappa hydroxylases are obtainable from the 
specific exemplified sequences provided herein. Furthermore, 
it will be apparent that one can obtain natural and synthetic 
5 plant kappa hydroxylases including modified amino acid 

sequences and starting materials for synthetic-protein 
modeling from the exemplified plant kappa hydroxylase and from 
plant kappa hydroxylases which are obtained through the use of 
such exemplified sequences. Modified amino acid sequences 

10 1= include sequences which have been mutated, truncated, 

increased and the like, whether such sequences were partially 
or wholly synthesized. Sequences which are actually purified 
from plant preparations or are identical or encode identical 
proteins thereto, regardless of the method used to obtain the 

15 ;:; : protein or sequence, are equally considered naturally derived, 
fir Thus, one skilled in the art will readily recognize that 

H : antibody preparations, nucleic acid probes (DNA and RNA) and 
the like may be prepared and used to screen and recover 
"homologous" or "related" kappa hydroxylases from a variety of 

20 plant sources. Typically, nucleic acid probes are labeled to 

allow detection, preferably with radioactivity although 
enzymes or other methods may also be used. For immunological 
screening methods, antibody preparations either monoclonal or 
polyclonal are utilized. Polyclonal antibodies, although less 

25 specific, typically are more useful in gene isolation. For 

detection, the antibody is labeled using radioactivity or any 
one of a variety of second antibody/ enzyme conjugate systems 
that are commercially available. 
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Homologous sequences are found when there is an identity 
of sequence and may be determined upon comparison of sequence 
information, nucleic acid or amino acid, or through 
hybridization reactions between a known kappa hydroxylase and 
5 a candidate source. Conservative changes, such as Glu/Asp, 

Val/Ile, Ser/Thr, Arg/Lys and Gln/Asn may also be considered 
in determining sequence homology. Typically, a lengthy 
nucleic acid sequence may show as little as 50-60% sequence 
identity, and more preferably at least about 70% sequence 

10 " identity, between the target sequence and the given plant 

kappa hydroxylase of interest excluding any deletions which 
may be present, and still be considered related. Amino acid 
sequences are considered homologous by as little as 25% 
sequence identity between the two complete mature proteins. 

15 (See generally, Doolittle, R.F., OF URFS and ORFS, University 

H Science Books, CA, 1986.) 

jl A genomic or other appropriate library prepared from the 

candidate plant source of interest may be probed with 
conserved sequences from the plant kappa hydroxylase to 

20 identify homologously related sequences. Use of an entire 

cDNA or other sequence may be employed if shorter probe 
sequences are not identified. Positive clones are then 
analyzed by restriction enzyme digestion and/or sequencing. 
When a genomic library is used, one or more sequences may be 

25 identified providing both the coding region, as well as the 

transcriptional regulatory elements of the kappa hydroxylase 
gene from such plant source. Probes can also be considerably 
shorter than the entire sequence. Oligonucleotides may be 
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used, for example, but should be at least about 10, preferably 
at least about 15, more preferably at least 20 nucleotides in 
length. When shorter length regions are used for comparison, 
a higher degree of sequence identity is required than for 
longer sequences. Shorter probes are often particularly 
useful for polymerase chain reactions (PCR) , especially when 
highly conserved sequences can be identified (See Gould, et 
al., 1989 for examples of the use of PCR to isolate homologous 
genes from taxonomically diverse species) . 

When longer nucleic acid fragments are employed (>100 bp) 
as probes, especially when using complete or large cDNA 
sequences, one would screen with low stringencies (for 
example, 40-50°C below the melting temperature of the probe) 
in order to obtain signal from the target sample with 20-50% 
deviation, i.e., homologous sequences. (Beltz, et al. 1983). 

In a preferred embodiment, a plant kappa hydroxylase of 
this invention will have at least 60% overall amino acid 
sequence similarity with the exemplified plant kappa 
hydroxylase. In particular, kappa hydroxylases which are 
obtainable from an amino acid or nucleic acid sequence of a 
castor or lesquerella kappa hydroxylase are especially 
preferred. The plant kappa hydroxylases may have preferential 
activity toward longer or shorter chain fatty acyl substrates. 
Plant fatty acyl hydroxylases having oleate-12-hydroxylase 
activity and eicosenoate-14-hydroxylase activity are both 
considered homologously related proteins because of in vitro 
evidence (Howling et al., 1972), and evidence disclosed 
herein, that the castor kappa hydroxylase will act on both 



substrates. Hydroxy lated fatty acids may be subject to 
further enzymatic modification by other enzymes which are 
normally present or are introduced by genetic engineering 
methods. For example, 14-hydroxy-ll, 17-eicosadienoic acid, 
which is present in some Lesquerella species (Smith 1985) , is 
thought to be produced by desaturation of 14 -hydroxy- 11- 
eicosenoic acid. 

Again, not only can gene clones and materials derived 
thereof be used to identify homologous plant fatty acyl 
hydroxylases, but the resulting sequences obtained therefrom 
may also provide a further method to obtain plant fatty acyl 
hydroxylases from other plant sources. In particular, PCR may 
be a useful technique to obtain related plant fatty acyl 
hydroxylases from sequence data provided herein. One skilled 
in the art will be able to design oligonucleotide probes based 
upon sequence comparisons or regions of typically highly 
conserved sequence. Of special interest are polymerase chain 
reaction primers based on the conserved regions of amino acid 
sequence between the castor kappa hydroxylase and the L. 
fendleri hydroxylase (SED ID NO: 4). Details relating to the 
design and methods for a PCR reaction using these probes are 
described more fully in the examples. 

It should also be noted that the fatty acyl hydroxylases 
of a variety of sources can be used to investigate fatty acid 
hydroxylation events in a wide variety of plant and in vivo 
applications. Because all plants synthesize fatty acids via a 
common metabolic pathway, the study and/or application of one 
plant fatty acid hydroxylase to a heterologous plant host may 




be readily achieved in a variety of species. 

Once the nucleic acid sequence is obtained, the 
transcription, or transcription and translation (expression) , 
of the plant fatty acyl hydroxylases in a host cell is desired 
5 to produce a ready source of the enzyme and/or modify the 

composition of fatty acids found therein in the form of free 
fatty acids, esters (particularly esterified to glycerolipids 
or as components of wax esters) , estolides, or ethers- Other 
useful applications may be found when the host cell is a plant 
10 :f host cell, in vitro and in vivo. 

iff For example, by increasing the amount of an kappa 

hydroxylase available to the plant, an increased percentage of 
uj ricinoleate or lesqueroleate (14-hydroxy-ll-eicosenoic acid) 
^ may be provided. 
15^ 

M Kappa hydroxylase 

By this invention, a mechanism for the biosynthesis of 
ricinoleic acid in plants is demonstrated. Namely, that a 

20 specific plant kappa hydroxylase having preferential activity 

toward fatty acyl substrates is involved in the accumulation 
of hydroxylated fatty acids in at least some plant species. 
The use of the terms ricinoleate or ricinoleic acid (or 
lesqueroleate or lesquerolic acid, densipoleate etc.) is 

25 intended to include the free acids, the ACP and CoA esters, 

the salts of these acids, the glycerolipid esters 
(particularly the triacylglycerol esters) , the wax esters, the 
estolides and the ether derivatives of these acids. 
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The determination that plant fatty acyl hydroxylases are 
active in the in vivo production of hydroxylated fatty acids 
suggests several possibilities for plant enzyme sources. And 
in fact, hydroxylated fatty acids are found in some natural 
plant species in abundance. For example, three hydroxy fatty 
acids related to ricinoleate occur in major amounts in seed 
oils from various Lesquerella species. Of particular 
interest, lesquerolic acid is a 20 carbon homolog of 
ricinoleate with two additional carbons at the carboxyl end of 
the chain (Smith 1985) . Other natural plant sources of 
hydroxylated fatty acids include but are not limited to seeds 
of the Linum genus, seeds of Wright ia species, Lycopodium 
species, Strophanthus species, Convolvulaces species, 
Calendula species and many others (van de Loo et al. , 1993). 

Plants having significant presence of ricinoleate or 
lesqueroleate or desaturated other or modified derivatives of 
these fatty acids are preferred candidates to obtain 
naturally-derived kappa hydroxylases. For example, 
Lesquerella. densipila. contains a diunsaturated 18 carbon fatty 
acid with a hydroxyl group (van de Loo et al., 1993) that is 
thought to be produced by an enzyme that is closely related to 
the castor kappa hydroxylase, according to the theory on which 
this invention is based. In addition, a comparison between 
kappa hydroxylases and between plant fatty acyl hydroxylases 
which introduce hydroxyl groups at positions other than the 
12 -carbon of oleate or the 14 -carbon of lesqueroleate or on 
substrates other than oleic acid and icosenoic acid may yield 
insights for protein modeling or other modifications to create 



synthetic hydroxylases as discussed above. For example, on 
the basis of information gained from structural comparisons of 
the A 12 desaturases and the kappa hydroxylase, we envision 
making genetic modifications in the structural genes for Al2 
desaturases that convert these desaturases to kappa- 
hydroxylases . We also envision making changes in A 15 
hydroxylases that convert these to hydroxylases with 
comparable substrate specificity to the desaturases (e.g. , 
conversion of 18 :2^' 12 to 150H-18 : 2* 9 ' 12 . Since the difference 
between a hydroxylase and a desaturases concerns the 
disposition of one proton, we envision that by systematically 
changing the charged groups in the region of the enzyme near 
the active site, we can effect this change. 

Especially of interest are fatty acyl hydroxylases which 
demonstrate activity toward fatty acyl substrates other than 
oleate, or which introduce the hydroxyl group at a location 
other than the C12 carbon. As described above, other plant 
sources may also provide sources for these enzymes through the 
use of protein purification, nucleic acid probes, antibody 
preparations, protein modeling, or sequence comparisons, for 
example, and of special interest are the respective amino acid 
and nucleic acid sequences corresponding to such plant fatty 
acyl hydroxylases. Also as previously described, once a 
nucleic acid sequence is obtained for the given plant 
hydroxylase, further plant sequences may be compared and/ or 
probed to obtain homologously related DNA sequences thereto 
and so on . 




Genetic Engineering Applications 

As is well known in the art, once a cDNA clone encoding a 
plant kappa hydroxylase is obtained, it may be used to obtain 
5 its corresponding genomic nucleic acid sequences thereto. 

The nucleic acid sequences which encode plant kappa 
hydroxylases may be used in various constructs, for example, 
as probes to obtain further sequences from the same or other 
species. Alternatively, these sequences may be used in 
10'=: conjunction with appropriate regulatory sequences to increase 
levels of the respective hydroxylase of interest in a host 
cell for the production of hydroxylated fatty acids or study 
jrf of the enzyme in vitro or in vivo or to decrease or increase 
;= levels of the respective hydroxylase of interest for some 
15:^; applications when the host cell is a plant entity, including 
^ plant cells, plant parts (including but not limited to seeds, 
H cuttings or tissues) and plants. 

A nucleic acid sequence encoding a plant kappa 
hydroxylase of this invention may include genomic, cDNA or 
2 0 mRNA sequence. By "encoding" is meant that the sequence 

corresponds to a particular amino acid sequence either in a 
sense or anti-sense orientation. By "recombinant" is meant 
that the sequence contains a genetically engineered 
modification through manipulation via mutagenesis, restriction 
25 enzymes, and the like. A cDNA sequence may or may not encode 

pre-processing sequences, such as transit or signal peptide 
sequences. Transit or signal peptide sequences facilitate the 
delivery of the protein to a given organelle and are 
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frequently cleaved from the polypeptide upon entry into the 
organelle, releasing the "mature" sequence. The use of the 
precursor DNA sequence is preferred in plant cell expression 
cassettes. 

Furthermore, as discussed above the complete genomic 
sequence of the plant kappa hydroxylase may be obtained by the 
screening of a genomic library with a probe, such as a cDNA 
probe, and isolating those sequences which regulate expression 
in seed tissue. In this manner, the transcription and 
translation initiation regions, introns, and/or transcript 
termination regions of the plant kappa hydroxylase may be 
obtained for use in a variety of DNA constructs, with or 
without the kappa hydroxylase structural gene. Thus, nucleic 
acid sequences corresponding to the plant kappa hydroxylase of 
this invention may also provide signal sequences useful to 
direct transport into an organelle 5 ' upstream non-coding 
regulatory regions (promoters) having useful tissue and timing 
profiles, 3' downstream non-coding regulatory region useful as 
transcriptional and translational regulatory regions and may 
lend insight into other features of the gene. 

Once the desired plant kappa hydroxylase nucleic acid 
sequence is obtained, it may be manipulated in a variety of 
ways. Where the sequence involves non-coding flanking 
regions, the flanking regions may be subjected to resection, 
mutagenesis, etc. Thus, transitions, transversions , 
deletions, and insertions may be performed on the naturally 
occurring sequence. In addition, all or part of the sequence 
may be synthesized. In the structural gene, one or more 




codons may be modified to provide for a modified amino acid 
sequence, or one or more codon mutations may be introduced to 
provide for a convenient restriction site or other purpose 
involved with construction or expression. The structural gene 
5 may be further modified by employing synthetic adapters, 

linkers to introduce one or more convenient restriction sites, 
or the like. 

The nucleic acid or amino acid sequences encoding a plant 
kappa hydroxylase of this invention may be combined with other 
10 % non-native, or "heterologous", sequences in a variety of ways. 
K? By "heterologous" sequences is meant any sequence which is not 

naturally found joined to the plant kappa hydroxylase, 
;if including, for example, combination of nucleic acid sequences 
from the same plant which are not naturally found joined 
1 5 )f i together . 

The DNA sequence encoding a plant kappa hydroxylase of 
1^ this invention may be employed in conjunction with all or part 
of the gene sequences normally associated with the kappa 
hydroxylase. In its component parts, a DNA sequence encoding 
20 kappa hydroxylase is combined in a DNA construct having, in 

the 5* to 3' direction of transcription, a transcription 
initiation control region capable of promoting transcription 
and translation in a host cell, the DNA sequence encoding 
plant kappa hydroxylase and a transcription and translation 
25 termination region. 

Potential host cells include both prokaryotic and 
eukaryotic cells. A host cell may be unicellular or found in 
a multicellular differentiated or undifferentiated organism 
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depending upon the intended use. Cells of this invention may 
be distinguished by having a plant kappa hydroxylase foreign 
to the wild-type cell present therein, for example, by having 
a recombinant nucleic acid construct encoding a plant kappa 
hydroxylase therein. 

Depending upon the host, the regulatory regions will 
vary, including regions from viral, plasmid or chromosomal 
genes, or the like. For expression in prokaryotic or 
eukaryotic microorganisms, particularly unicellular hosts, a 
wide variety of constitutive or regulatable promoters may be 
employed. Expression in a microorganism can provide a ready 
source of the plant enzyme. Among transcriptional initiation 
regions which have been described are regions from bacterial 
and yeast hosts, such as E. coll, B. subtills, Saccharomyces 
cerevisiae, including genes such as beta-galactosidase, T7 
polymerase, tryptophan E and the like. 

For the most part, the constructs will involve regulatory 
regions functional in plants which provide for modified 
production of plant kappa hydroxylase with resulting 
modification of the fatty acid composition. The open reading 
frame, coding for the plant kappa hydroxylase or functional 
fragment thereof will be joined at its 5» end to a 
transcription initiation regulatory region such as the wild- 
type sequence naturally found 5' upstream to the kappa 
hydroxylase structural gene. Numerous other transcription 
initiation regions are available which provide for a wide 
variety of constitutive or regulatable, e.g., inducible, 
transcription of the structural gene functions. Among 



transcriptional initiation regions used for plants are such 
regions associated with the structural genes such as for 
nopaline and mannopine synthases, or with napin, soybean 0- 
conglycinin, oleosin, 12S storage protein, the cauliflower 
mosaic virus 3 5S promoters and the like. The transcription/ 
translation initiation regions corresponding to such 
structural genes are found immediately 5 1 upstream to the 
respective start codons. In embodiments wherein the 
expression of the kappa hydroxylase protein is desired in a 
plant host, the use of all or part of the complete plant kappa 
hydroxylase gene is desired; namely all or part of the 5 1 
upstream non-coding regions (promoter) together with the 
structural gene sequence and 3 1 downstream non-coding regions 
may be employed. If a different promoter is desired, such as 
a promoter native to the plant host of interest or a modified 
promoter, i.e., having transcription initiation regions 
derived from one gene source and translation initiation 
regions derived from a different gene source, including the 
sequence encoding the plant kappa hydroxylase of interest, or 
enhanced promoters, such as double 35S CaMV promoters, the 
sequences may be joined together using standard techniques. 

For such applications when 5' upstream non-coding regions 
are obtained from other genes regulated during seed 
maturation, those preferentially expressed in plant embryo 
tissue, such as transcription initiation control regions from 
the B . napus napin gene, or the Arabidopsis 12 S storage 
protein, or soybean jS-conglycinin (Bray et al., 1987), or the 
L . fendleri kappa hydroxylase promoter described herein are 



desired. Transcription initiation regions which are 
preferentially expressed in seed tissue, i.e., which are 
undetectable in other plant parts, are considered desirable 
for fatty acid modifications in order to minimize any 
disruptive or adverse effects of the gene product. 

Regulatory transcript termination regions may be provided 
in DNA constructs of this invention as well. Transcript 
termination regions may be provided by the DNA sequence 
encoding the plant kappa hydroxylase or a convenient 
transcription termination region derived from a different gene 
source, for example, the transcript termination region which 
is naturally associated with the transcript initiation region. 
Where the transcript termination region is from a different 
gene source, it will contain at least about 0.5 kb, preferably 
about 1-3 kb of sequence 3 • to the structural gene from which 
the termination region is derived. 

Plant expression or transcription constructs having a 
plant kappa hydroxylase as the DNA sequence of interest for 
increased or decreased expression thereof may be employed with 
a wide variety of plant life, particularly, plant life 
involved in the production of vegetable oils for edible and 
industrial uses. Most especially preferred are temperate 
oilseed crops. Plants of interest include, but are not 
limited to rapeseed (Canola and high erucic acid varieties) , 
Crambe, Brassiest juncea, Brassica nigra, meadowfoam, flax, 
sunflower, safflower, cotton, Cuphea, soybean, peanut, coconut 
and oil palms and corn. An important criterion in the 
selection of suitable plants for the introduction on the kappa 



hydroxylase is the presence in the host plant of a suitable 
substrate for the hydroxylase. Thus, for example, production 
of ricinoleic acid will be best accomplished in plants that 
normally have high levels of oleic acid in seed lipids. 
Similarly, production of lesquerolic acid will best be 
accomplished in plants that have high levels of icosenoic acid 

in seed lipids. 

Depending on the method for introducing the recombinant 
constructs into the host cell, other DNA sequences may be 
required. Importantly, this invention is applicable to 
dicotyledons and monocotyledons species alike and will be 
readily applicable to new and/or improved transformation and 
regulation techniques. The method of transformation is not 
critical to the current invention; various methods of plant 
transformation are currently available. As newer methods are 
available to transform crops, they may be directly applied 
hereunder. For example, many plant species naturally 
susceptible to Agrobacterium infection may be successfully 
transformed via tripartite or binary vector methods of 
Agrobacterium mediated transformation. In addition, 
techniques of microinjection, DNA particle bombardment, 
electroporation have been developed which allow for the 
transformation of various monocot and dicot plant species. 

In developing the DNA construct, the various components 
of the construct or fragments thereof will normally be 
inserted into a convenient cloning vector which is capable of 
replication in a bacterial host, e.g., E. coll. Numerous 
vectors exist that have been described in the literature. 




After each cloning, the plasmid may be isolated and subjected 
to further manipulation, such as restriction, insertion of new 
fragments, ligation, deletion, insertion, resection, etc., so 
as to tailor the components of the desired sequence. Once the 
5 construct has been completed, it may then be transferred to an 

appropriate vector for further manipulation in accordance with 
the manner of transformation of the host cell. 

Normally, included with the DNA construct will be a 
structural gene having the necessary regulatory regions for 

10 r expression in a host and providing for selection of 

transformant cells. The gene may provide for resistance to a 
cytotoxic agent, e.g., antibiotic, heavy metal, toxin, etc., 
complementation providing prototropy to an auxotrophic host, 
viral immunity or the like. Depending upon the number of 

15 =- different host species the expression construct or components 
thereof are introduced, one or more markers may be employed, 
where different conditions for selection are used for the 
different hosts. 

It is noted that the degeneracy of the DNA code provides 

20 that some codon substitutions are permissible of DNA sequences 

without any corresponding modification of the amino acid 
sequence . 

As mentioned above, the manner in which the DNA construct 
is introduced into the plant host is not critical to this 
25 invention. Any method which provides for efficient 

transformation may be employed. Various methods for plant 
cell transformation include the use of Ti- or Ri-plasmids, 
microinjection, electroporation, infiltration, imbibition, DNA 
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particle bombardment, liposome fusion, DNA bombardment or the 
like. In many instances, it will be desirable to have the 
construct bordered on one or both sides of the T— DNA , 
particularly having the left and right borders, more 
particularly the right border. This is particularly useful 
when the construct uses A. tumefaclens or A. rhizogenes as a 
mode for transformation, although the T-DNA borders may find 
use with other modes of transformation. 

Where Agrobacterium is used for plant cell 
transformation, a vector may be used which may be introduced 
into the Agrobacterium host for homologous recombination with 
T-DNA or the Ti- or Ri-plasmid present in the Agrobacterium 
host. The Ti- or Ri-plasmid containing the T-DNA for 
recombination may be armed (capable of causing gall formation) 
or disarmed (incapable of causing gall), the latter being 
permissible, so long as the vir genes are present in the 
transformed Agrobacterium host. The armed plasmid can give a 
mixture of normal plant cells and gall. 

In some instances where Agrobacterium is used as the 
vehicle for transforming plant cells, the expression construct 
bordered by the T-DNA border (s) will be inserted into a broad 
host spectrum vector, there being broad host spectrum vectors 
described in the literature. Commonly used is pRK2 or 
derivatives thereof. See, for example, Ditta et al., (1980), 
which is incorporated herein by reference. Included with the 
expression construct and the T-DNA will be one or more 
markers, which allow for selection of transformed 
Agrobacterium and transformed plant cells. A number of 



markers have * been* developed for use with plant cells, such as 
resistance to kanamycin, the aminoglycoside G418, hygromycin, 
or the like. The particular marker employed is not essential 
to this invention, one or another marker being preferred 
depending on the particular host and the manner of 
construction . 

For transformation of plant cells using Agrobacterlum , 
explants may be combined and incubated with the transformed 
Agrobacterium for sufficient time for transformation, the 
bacteria killed, and the plant cells cultured in an 
appropriate selective medium. Once callus forms, shoot 
formation can be encouraged by employing the appropriate plant 
hormones in accordance with known methods and the shoots 
transferred to rooting medium for regeneration of plants. The 
plants may then be grown to seed and the seed used to 
establish repetitive generations and for isolation of 
vegetable oils. 

The invention now being generally described, it will be 
more readily understood by reference to the following examples 
which are included for purposes of illustration only and are 
not intended to limit the present invention. 

EXAMPLES 

In the experimental disclosure which follows, all 
temperatures are given in degrees centigrade ( ° ) , weights are 
given in grams (g) , milligram (mg) or micrograms (/*g) , 
concentrations are given as molar (M) , millimolar (mM) or 
micromolar (fM) and all volumes are given in liters (1) , 



microliters (//l) or milliliters (ml) , unless otherwise 
indicated. 



EXAMPLE 1 - PRODUCTION OF NOVEL HYDROXYLATED FATTY ACIDS IN 
ARABIDOPSIS THALIANA 
Over view 

The kappa hydroxylase encoded by the previously described 
fahl2 gene from Castor (U.S. Patent application 08/320,982) 
was used to produce ricinoleic acid, lesquerolic acid, 
densipolic acid and auricolic acid in transgenic Arabidopsis 
plants. This example reduces to practice the method taught in 
Example 2 of the foregoing application. 

Production of transgenic plants 

A variety of methods have been developed to insert a DNA 
sequence of interest into the genome of a plant host to obtain 
the transcription and translation of the sequence to effect 
phenotypic changes. The following methods represent only one 
of many equivalent means of producing transgenic plants and 
causing expression of the hydroxylase gene. 

Arabidopsis plants were transformed, by Agrobacterlum- 
mediated transformation, with the kappa hydroxylase encoded by 
the Castor fahl2 gene on binary Ti plasmid pB6. This plasmid 
was previously used to transform Nicotiana tabacum for the 

production of ricinoleic acid (U.S. Patent application 
08/320,982) . 

Inoculums of Agrobacterlum tumefaciens strain GV3101 



containing binary Ti plasmid pB6 were plated on L-broth plates 
containing 50 /ig/ml kanamycin and incubated for 2 days at 
30 °C. Single colonies were used to inoculate large liquid 
cultures (L-broth medium with 50 mg/1 rifampicin, 110 mg/1 
gentamycin and 200 mg/1 kanamycin) to be used for the 
transformation of Arabidopsis plants. 

Arabidopsis plants were transformed by the in planta. 
transformation procedure essentially as described by Bechtold 
et al., (1993). Cells of A. tumefaciens GV3101(pB6) were 
harvested from liquid cultures by centrifugation, then 
resuspended in infiltration medium at OD 600 =0.8 (Infiltration 
medium was Murashige and Skoog macro and micronutrient medium 
(Sigma Chemical Co., St. Louis, MO) containing 10 mg/1 
6-benzylaminopurine and 5% glucose) . Batches of 12-15 plants 
were grown for 3 to 4 weeks in natural light at a mean daily 
temperature of approximately 25°C in 3.5 inch pots containing 
soil. The intact plants were immersed in the bacterial 
suspension then transferred to a vacuum chamber and placed 
under 600 mm of vacuum produced by a laboratory vacuum pump 
until tissues appeared uniformly water-soaked (approximately 
10 min) . The plants were grown at 25 °C under continuous light 
(100 fimol m' 2 s" 1 irradiation in the 400 to 700 nm range) for 
four weeks. The seeds obtained from all the plants in a pot 
were harvested as one batch. The seeds were sterilized by 
sequential treatment for 2 min with ethanol followed by 10 min 
in a mixture of household bleach (Chlorox) , water and Tween-8 0 
(50%, 50%, 0.05%) then rinsed thoroughly with sterile water. 
The seeds were plated at high density (2000 to 4000 per plate) 




onto agar-solidif ied medium in 100 mm petri plates containing 
1/2 X Murashige and Skoog salts medium enriched with B5 
vitamins (Sigma Chemical Co. , St. Louis, MO) and containing 
kanamycin at 50 mg/1. After incubation for 48 h at 4°C to 
5 stimulate germination, seedlings were grown for a period of 

seven days until transf ormants were clearly identifiable as 
healthy green seedlings against a background of chlorotic 
kanamycin-sensitive seedlings. The transf ormants were 
transferred to soil for two weeks before leaf tissue could be 
10 j=| used for DNA and lipid analysis. More than 20 transf ormants 
;5 were obtained. 

^ DNA was extracted from young leaves from transformants to 

;I verify the presence of an intact fahl2 gene. The presence of 
the transgene in a number of the putative transgenic lines was 
15 ]!f verified by using the polymerase chain reaction to amplify the 

\ u insert from pB6. The primers used were HF2 = 

O GCTCTTTTGTGCGCTCATTC (SEQ ID NO: 12) and HR1 = 

CGGTACCAGAAAACGCCTTG (SEQ ID NO: 13), which were designed to 
allow the amplification of a 700 bp fragment. Approximately 
20 100 ng of genomic DNA was added to a solution containing 25 

pmol of each primer, 1.5 U Taq polymerase (Boehringer 
Manheim) , 200 uM of dNTPs, 50 mM KCl, 10 mM Tris.Cl (pH 9), 
0.1% (v/v) Triton X-100, 1.5 mM MgCl 2 , 3% (v/v) formamide, to a 
final volume of 50 fil. Amplifications conditions were: 4 min 
25 denaturation step at 94°C, followed by 30 cycles of 92 °C for 1 

min, 55 °C for 1 min, 72 °C for 2 min. A final extension step 
closed the program at 72 °C for 5 min. Transformants could be 
positively identified after visualization of a characteristic 



- 38 - 




* 

1 kb amplified fragment on an ethidium bromide stained agarose 
gel. All transgenic lines tested gave a PCR product of a size 
consistent with the expected genotype, confirming that the 
lines were, indeed, transgenic. All further experiments were 
5 done with three representative transgenic lines of the wild 

type designated as 1-3, 4D, 7-4 and one transgenic line of the 
fad2 mutant line JB12. The transgenic JB12 line was included 
in order to test whether the increased accumulation of oleic 
acid in this mutant would have an effect on the amount of 
1Q=: ricinoleic acid that accumulated in the transgenic plants. 

jri Analysis of transgenic plants 

Leaves and seeds from fahl2 transgenic Arabidopsis plants 
were analyzed for the presence of hydroxylated fatty acids 

l$i using gas chromatography. Lipids were extracted from 100-200 
mg leaf tissue or 50 seeds. Fatty acid methyl esters (FAMES) 
were prepared by placing tissue in 1.5 ml of 1.0 M methanolic 
HC1 (Supelco Co.) in a 13 x 100 mm glass screw-cap tube capped 
with a teflon-lined cap and heated to 80°C for 2 hours. Upon 

20 cooling, 1 ml petroleum ether was added and the FAMES removed 

by aspirating off the ether phase which was then dried under a 
nitrogen stream in a glass tube. One hundred /xl of 
N,0-bis(Trimethylsilyl) trif luoroacetamide (BSTFA; Pierce 
Chemical Co) and 200 /Ltl acetonitrile was added to derivatize 

25 the hydroxy 1 groups. The reaction was carried out at 70° C for 

15 min. The products were dried under nitrogen, redissolved 
in 100 jul chloroform and transferred to a gas chromatograph 
vial. Two ill of each sample were analyzed on a SP2 34 0 fused 
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silica capillary column (30 m, 0.75 mm ID, 0.20 mm film, 
Supelco) , using a Hewlett-Packard 5890 II series Gas 
Chromatograph . The samples were not split, the temperature 
program was 195 °C for 18 min, increased to 230°C at 25°C/min, 
5 held at 230°C for 5 min then down to 195°C at 25°C/min. , and 

flame ionization detectors were used. 

The chromatographic elution time of methyl esters and O- 
TMS derivatives of ricinoleic acid, lesguerolic acid and 
auricolic acid was established by GC-MS of lipid samples from 

10 seeds of L. fendleri and comparison to published chromatograms 

of fatty acids from this species (Carlson et al., 1990). A O- 
TMS-methyl-ricinoleate standard was prepared from ricinoleic 
acid obtained from Sigma Chemical Co (St, Louis, MO). 0-TMS- 
methyl-lesqueroleate and O-TMS-methyl-auricoleate standards 

15 were prepared from triacylglycerols purified from seeds of L. 

fendleri. The mass spectrum of O-TMS-methyl-ricinoleate, 0- 
7 TMS-methyl-densipoleate, O-TMS-methyl-lesqueroleate, and O- 
TMS-methyl-auricoleate are shown in Figures 1A-D, 
respectively. The structures of the characteristic ions 

20 produced during mass spectrometry of these derivatives are 

shown in Figure 2 . 

Lipid extracted from transgenic tissues were analyzed by 
gas chromatography and mass spectrometry for the presence of 
hydroxy lated fatty acids. As a matter of reference, the 

25 average fatty acid composition of leaves in Arabidopsis wild 

type and fad2 mutant lines was reported by Miguel and Browse 
(1992) . Gas chromatograms of methylated and silylated fatty 
acids from seeds of wild type and a fahl2 transgenic wild type 
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plant are shown in Figures 3A and 3B, respectively. The 
profiles are very similar except for the presence of three 
small but distinct peaks at 14.3, 15.9 and 18.9 minutes. A 
very small peak at 20.15 min was also evident. The elution 
5 time of the peaks at 14.3 and 18.9 min corresponded precisely 

to that of comparably prepared ricinoleic and lesquerolic 
standards, respectively. No significant differences were 
observed in lipid extracts from leaves or roots of the wild 
type and the fahl2 transgenic wild type lines (Table 1) . 
10 Thus, in spite of the fact that the fahi2 gene is expressed 

throughout the plant, we observed effects on fatty acid 
composition only in seed tissue. A similar observation was 
described previously for transgenic fahl2 tobacco in patent 
application No. 08/320,982. 

15 o 
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Table 1 : Fatty acid composition of lipids from transgenic and wild type Arabidopsis. The values are 
the means obtained from analysis of samples from three independent transgenic lines, or three 
independent samples of wild type and fad2 lines. 
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In order to confirm that the observed new peaks in the 
transgenic lines corresponded to derivatives of ricinoleic, 
lesquerolic, densipolic and auricolic acids, mass spectrometry 
was used. The fatty acid derivatives were resolved by gas 
5 chromatography as described above except that a 

Hewlett-Packard 5971 series mass selective detector was used 
in place of the flame ionization detector used in the previous 
experiment. The spectra of the four new peaks in Figure 3B 
(peak numbers 10, 11, 12 and 13) are shown in Figures 4A-D, 

10 ~ respectively. Comparison of the spectrum obtained for the 
standards with that obtained for the four peaks from the 
transgenic lines confirms the identity of the four new peaks. 
On the basis of the three characteristic peaks at M/Z 187, 270 
and 299, peak 10 is unambiguously identified as O-TMS- 

15 methylricinoleate. On the basis of the three characteristic 

peaks at M/Z 185, 270 and 299, peak 11 is unambiguously 
identified as O-TMS-methyldensipoleate. On the basis of the 
three characteristic peaks at M/Z 187, 298 and 327, peak 12 is 
unambiguously identified as O-TMS-methyllesqueroleate. On the 

2 0 basis of the three characteristic peaks at M/Z 185, 298 and 

327, peak 13 is unambiguously identified as O-TMS- 
methylauricoleate. These results unequivocally demonstrate 
the identity of the fah!2 cDNA as encoding a hydroxylase that 
hydroxylates both oleic acid to produce ricinoleic acid and 

2 5 also hydroxy late icosenoic acid to produce lesquerolic acid. 

These results also provide additional evidence that the 
hydroxylase can be functionally expressed in a heterologous 
plant species in such a way that the enzyme is catalytically 
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functional. These results also demonstrate that expression of 
this hydroxylase gene leads to accumulation of ricinoleic, 
lesquerolic, densipolic and auricolic acids in a plant species 
that does not normally accumulate hydroxy lated fatty acids in 
5 extractable lipids. 

The presence of lesquerolic acid in the transgenic plants 
was anticipated in the previous patent application (No. 
08/320,982) based on the biochemical evidence suggesting broad 
substrate specificity of the kappa hydroxylase. By contrast, 

10 O the accumulation of densipolic and auricolic acids was less 
|H predictable. Since Arabidopsis does not normally contain 
in significant quantities of the non-hydroxylated precursors of 
03 these fatty acids which could serve as substrates for the 
hydroxylase, it appears that one or more of the three n-3 

15 m fatty acid desaturases known in Arabidopsis (eg., f ad3 , fad7, 
fad8; reviewed in Gibson et al. , 1995) are capable of 
desaturating the hydroxylated compounds at the n-3 position. 
That is, densipolic acid is produced by the action of an n-3 
desaturase on ricinoleic acid. Auricolic acid is produced by 

20 the action of an n-3 desaturase on lesquerolic acid. Because 

it is located in the endoplasmic reticulum, the fad3 
desaturase is almost certainly responsible. This can be 
tested in the future by producing f ahl2 -containing transgenic 
plants of the f ad3-def icient mutant of Arabidopsis (similar 

25 experiments can be done with fad7 and fad8) . It is also 

formally possible that the enzymes that normally elongate 
18:1 cisA9 tQ 20:l cisAl1 may elongate 120H-18 : l CiS * ? to l4OH-2 0:l CisA1 \ 

and 120H-18:2 c1sA9 ' 15 to 14OH-20 : 2 cis * 11 ' 17 . 
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The amount of the various fatty acids in seed, leaf and 
root lipids of the control and transgenic plants is presented 
in Table 1. Although the amount of hydroxylated fatty acids 
produced in this example is less than desired for commercial 
production of ricinoleate and other hydroxylated fatty acids 
from plants, we envision numerous improvements of this 
invention that will increase the level of accumulation of 
hydroxylated fatty acids in plants that express the fahl2 or 
related hydroxylase genes. Improvements in the level and 
tissue specificity of expression of the hydroxylase gene is 
envisioned. Methods to accomplish this by the use of strong, 
seed-specific promoters such as the B . napus nap in promoter or 
the native promoters of the castor fahl2 gene or the 
corresponding hydroxylase gene from L. fendleri will be 
obvious to one skilled in the art. Additional improvements 
are envisioned to involve modification of the enzymes which 
cleave hydroxylated fatty acids from phosphatidylcholine, 
reduction in the activities of enzymes which degrade 
hydroxylated fatty acids and replacement of acyltransf erases 
which transfer hydroxylated fatty acids to the sn-1, sn-2 and 
sn-3 positions of glycerolipids . Although genes for these 
enzymes have not been described in the scientific literature, 
their utility in improving the level of production of 
hydroxylated fatty acids can be readily envisioned based on 
the results of biochemical investigations of ricinoleate 
synthesis. 

Although Arabidopsis is not an economically important 
plant species, it is widely accepted by plant biologists as a 



model for higher plants. Therefore, the inclusion of this 
example is intended to demonstrate the general utility of the 
invention described here and in the previous application (No. 
08/320,982) to the modification of oil composition in higher 
plants. One advantage of studying the expression of this 
novel gene in Arabidopsis is the existence in this system of a 
large body of knowledge on lipid metabolism, as well as the 
availability of a collection of mutants which can be used to 
provide useful information on the biochemistry of fatty acid 
hydroxylation in plant species. Another advantage is the ease 
of transposing any of the information obtained on metabolism 
of ricinoleate in Arabidopsis to closely related species such 
as the crop plants Brassica napus, Brassica juncea or Crambe 
Abyssinica in order to mass produce ricinoleate, lesqueroleate 
or other hydroxylated fatty acids for industrial use. The 
kappa hydroxylase is useful for the production of ricinoleate 
or lesqueroleate in any plant species that accumulates 
significant levels of the precursors, oleic acid and icosenoic 
acid. Of particular interest are genetically modified 
varieties that accumulate high levels of oleic acid. Such 
varieties are currently available for sunflower and Canola. 
Production of lesquerolic acid and related hydroxy fatty acids 
can be achieved in species that accumulate high levels of 
icosenoic acid or other long chain monoenoic acids. Such 
plants may in the future be produced by genetic engineering of 
plants that do not normally make such precursors. Thus, we 
envision that the use of the kappa hydroxylase is of general 
utility. 




1 

». 

EXAMPLE 2. ISO LATION OF LESOUERELLA KAPPA HYDROXYLASE GENOMIC 
CLONE 

Overview 

Regions of nucleotide sequence that were conserved in both the 
Castor kappa hydroxylase and the Arabidopsis fad2 A 12 fatty 
acid desaturase were used to design oligonucleotide primers. 
These were used with genomic DNA from Lesquerella fendleri to 
amplify fragments of several homologous genes. These 
amplified fragments were then used as hybridization probes to 
identify full length genomic clones from a genomic library of 
L. fendleri. Hydroxylated fatty acids are specific to the 
seed tissue of iesguerella sp. , and are not found to any 
appreciable extent in vegetative tissues. One of the two 
genes identified by this method was expressed in both leaves 
and developing seeds and is therefore thought to correspond to 
the Al2 fatty acid desaturase. The other gene was expressed 
at high levels in developing seeds but was not expressed or 
was expressed at very low levels in leaves and is the kappa 
hydroxylase from this species. The identity of this gene will 
be established by introducing the gene into transgenic 
Arabidopsis plants and showing that it causes the accumulation 
of ricinoleic acid, lesquerolic acid, densipolic acid and 
auricolic acid in seed lipids. The promoter of this gene is 
also of utility because it is able to direct expression of a 
gene specifically in developing seeds at a time when storage 
lipids are accumulating. This promoter is, therefore, of 
great utility for many applications in the genetic engineering 
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of seeds, particularly in members of the Brass icacea. 

The various steps involved in this process are described 
in detail below. Unless otherwise indicated, routine methods 
for manipulating nucleic acids, bacteria and phage were as 
described by Sambrook et al. (1989) . 

Isolat ion of a fragment of the Lescruerella kappa hydroxylase 
gene 

Oligonucleotide primers for the amplification of the L. 
fendleri kappa hydroxylase were designed by choosing regions 
of high deduced amino acid sequence homology between the 
Castor kappa hydroxylase and the Arabidopsis A 12 desaturase 
(fad2) . Because most amino acids are encoded several 
different codons, these oligonucleotides were designed to 
encode all possible codons that could encode the corresponding 
amino acids. The sequence of these mixed oligonucleotides was: 
Oligo 1: TAYWSNCAYMGNMGNCAYCA (SEQ ID NO: 14) 
Oligo 2: RTGRTGNGCNACRTGNGTRTC (SEQ ID NO: 15) 

(Where: Y = C+T; W = A+T; S = G+C; N = A+G+C+T; M = A+C; R = 
A+G) 

These oligonucleotides were used to amplify a fragment of 
DNA from L. fendleri genomic DNA by the polymerase chain 
reaction (PCR) using the following conditions: Approximately 
100 ng of genomic DNA was added to a solution containing 25 
pmol of each primer, 1.5 U Taq polymerase (Boehringer 
Manheim) , 200 uM of dNTPs, 50 mM KCl, 10 mM Tris.Cl (pH 9) , 
0.1% (v/v) Triton X-100, 1 . 5 mM MgCl 2 , 3% (v/v) fonrtamide, to a 
final volume of 50 /xl. Amplifications conditions were: 4 min 



denaturation step at 94 °C, followed by 30 cycles of 92 °C for 1 
min, 55 °C for l min, 72 °C for 2 min. A final extension step 
closed the program at 72 °C for 5 min. 

PCR products of approximately 540 bp were observed 
following electrophoretic separation of the products of the 
PCR reaction in agarose gels. Two of these fragments were 
cloned into pBluescript (Stratagene) to give rise to plasmids 
pLesq2 and pLesq3 . The sequence of the inserts in these two 
plasmids was determined by the chain termination method. The 
sequence of the insert in pLesq2 is presented as Figure 5 (SEQ 
ID NO:l) and the sequence of the insert in pLesq3 is presented 
as Figure 6 (SEQ ID NO: 2). The high degree of sequence 
identity between the two clones indicated that they were both 
potential candidates to be either a A 12 desaturase or a gama 
hydroxylase. 

Northern analysis 

In L. fendleri, hydroxylated fatty acids are found in 
large amounts in seed oils but are not found in appreciable 
amounts in leaves. Therefore, an important criterion in 
discriminating between a fatty acyl desaturase and kappa 
hydroxylase is that the kappa hydroxylase gene is expected to 
be expressed more highly in tissues which have high level of 
hydroxylated fatty acids than in other tissues whereas all 
plant tissues should contain mRNA for an <o6 fatty acyl 
desaturase since diunsaturated fatty acids are found in the 
lipids of all tissues in most or all plants. Therefore, it 
was of great interest to determine whether the gene 




corresponding to pLesq2 was also expressed only in seeds, or 
is also expressed in other tissues. This question was 
addressed by testing for hybridization of pLesq2 to RNA < 
purified from developing seeds and from leaves. 
5 Total RNA was purified from developing seeds and young 

leaves of L. fendlerl using an Rneasy RNA extraction kit 
(Qiagen) , according to the manufacturer's instructions. RNA 
concentrations were quantified by UV spectrophotometry at 
A.=260 and 280 nm. In order to ensure even loading of the gel 
10 p to be used for Northern blotting, RNA concentrations were 

further adjusted after recording fluorescence under UV light 
IV of RNA samples stained with ethidium bromide and run on a test 
^ denaturing gel. 

i=* Total RNA prepared as described above from leaves and 

15:^; developing seeds was electrophoresed through an agarose gel 
JIT containing formaldehyde (Iba et al. , 1993). An equal quantity 
^ (10 pnq) of RNA was loaded in both lanes, and RNA standards 

(0.16-1.77 kb ladder, Gibco-BRL) were loaded in a third lane. 
Following electrophoresis, RNA was transferred from the gel to 
20 a nylon membrane (Hybond N+, Amersham) and fixed to the filter 

by exposure to UV light. A 32 P- label led probe was prepared 
from insert DNA of clone pLesq2 by random priming and 
hybridized to the membrane overnight at 52°C, after it had been 
prehybridized for 2 h. The prehybridization solution 
25 contained 5X SSC, 10X Denhardt's solution, 0.1% SDS, 0.1M KP0 4 

pH 6.8, 100 /<g/ml salmon sperm DNA. The hybridization 
solution had the same basic composition, but no SDS, and it 
contained 10% dextran sulfate and 3 0% formamide. The blot was 
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washed once in 2X SSC, 0.5% SDS at 65°C then in IX SSC at the 
same temperature. 

Brief (30 min) exposure of the blot to X-ray film 
revealed that the probe pLesq2 hybridized to a single band 
only in the seed RNA lane (Figure 7) . The blot was re-probed 
with the insert from pLesq3 gene, which gave bands of similar 
intensity in the seed and leaf lanes (Figure 7} . 

These results show that the gene corresponding to the 
clone pLesq2 is highly and specifically expressed in seed of 
L. fendleri. In conjunction with knowledge of the nucleotide 
and deduced amino acid sequence, strong seed-specific 
expression of the gene corresponding to the insert in pLesq2 
is a convincing indicator of the role of the enzyme in 
synthesis of hydroxylated fatty acids in the seed oil. 

Characterization of a genomic clone of the gamma hydroxylase 

Genomic DNA was prepared from young leaves of L. fendleri 
as described by Murray and Thompson (1980) . A Sau3Al -partial 
digest genomic library constructed in the vector ADashll 
(Stratagene, 11011 North Torrey Pines Road, La Jolla CA 92037) 
was prepared by partially digesting 500 fjq of DNA, size- 
selecting the DNA on a sucrose gradient (Sambrook et al. , 
1989) , and ligating the DNA (12 kb average size) to the BamHT- 
digested arms of ADashll. The entire ligation was packaged 
according to the manufacturer's conditions and plated on E. 
coli strain XLl-Blue MRA-P2 (Stratagene) . This yielded 5xl0 5 
primary recombinant clones. The library was then amplified 
according to the manufacturer's conditions. A fraction of the 




genomic library was plated on E. coli XLl-Blue and resulting 
plaques (150,000) were lifted to charged nylon membranes 
(Hybond N+, Amersham) , according to the manufacturer's 
conditions. DNA was crosslinked to the filters under UV in a 
5 Stratalinker (Stratagene) . 

Several clones carrying genomic sequences corresponding 
to the L. fendleri hydroxylase were isolated by probing the 
membranes with the insert from pLesq2 that was PCR-amplif ied 
with internal primers and labelled with 32 P by random priming. 
10.£: The filters were prehybridized for 2 hours at 65°C in 7% SDS, 
m lmM EDTA, 0.25 M Na 2 HP0 4 (pH 7.2), 1% BSA and hybridized to 
m the probe for 16 hours in the same solution. The filters were 
m sequentially washed at 65°C in solutions containing 2 X SSC, 1 
q X SSC, 0.5 X SSC in addition to 0.1 % SDS. A 2.6 kb Xba I 
15j=y fragment containing the complete coding sequence for the 
q gamma-hydroxylase and approximately 1 kb of the 5* upstream 
region was subcloned into the corresponding site of 
pBluescript KS to produce plasmid pLesq-Hyd and the sequence 
determined completely using an automatic sequencer by the 
2 0 dideoxy chain termination method. Sequence data was analyzed 

using the program DNASIS (Hitachi company) . 

The sequence of the insert in clone pLesq-Hyd is shown in 
Figures 8A-B. The sequence entails 1855 bp of contiguous DNA 
sequence (SEQ ID NO: 3). The clone encodes a 401 bp 5 1 
25 untranslated region (i.e., nucleotides preceding the first ATG 

codon) , an 1152 bp open reading frame, and a 302 bp 3' 
untranslated region. The open reading frame encodes a 384 
amino acid protein with a predicted molecular weight of 44,370 



(SEQ ID NO: 4). The amino terminus lacks features of a typical 
signal peptide (von Heijne, 1985). 

The exact translation-initiation methionine has not been 
experimentally determined, but on the basis of deduced amino 
acid sequence homology to the Castor kappa hydroxylase (noted 
below) is thought to be the methionine encoded by the first 
ATG codon at nucleotide 402. 

Comparison of the pLesq-Hyd deduced amino acid sequence 
with sequences of membrane-bound desaturases and the castor 
hydroxylase (Figures 9A-B) indicates that pLesq-Hyd is 
homologous to these genes. This figure shows an alignment of 
the L. fendleri hydroxylase (SEQ ID NO: 4) with the castor 
hydroxylase (van de Loo et al. 1995), the Arabidopsis fad2 
cDNA which encodes an endoplasmic reticulum-localized Al2 
desaturase (called fad2) (Okuley et al., 1994), two soybean 
fad2 desaturase clones, a Brassica napus fad2 clone, a Zea 
mays fad2 clone and partial sequence of a R. communis fad2 
clone. The high degree of sequence homology indicates that 
the gene products are of similar function. For instance, the 
overall homology between the Lesquerella hydroxylase and the 
Arabidopsis fad2 desaturase was 92.2% similarity and 84.8% 
identity and the two sequences differed in length by only one 
amino acid . 

Southern hybridization 

Southern analysis was used to examine the copy number of 
the genes in the L. fendleri genome corresponding to the clone 



pLesq-Hyd. Genomic DNA (5 peg) was digested with EcoR I, Hind 
III and Xba I and separated on a 0.9% agarose gel. DNA was 
alkali-blotted to a charged nylon membrane (Hybond N+, 
Amersham) , according to the manufacturer's protocol. The blot 
was prehybridized for 2 hours at 65°C in 7% SDS, ImM EDTA, 0.25 
M Na 2 HP0 4 (pH 7.2), 1% BSA and hybridized to the probe for 16 
hours in the same solution with pLesq-Hyd insert PCR-amplif ied 
with internal primers and labelled with 32 P by random priming. 
The filters were sequentially washed at 65°C in solutions 
containing 2 X SSC, 1 X SSC, 0.5 X SSC in addition to 0.1 % 
SDS, then exposed to X-ray film. 

The probe hybridized with a single band in each digest of 
L. fendleri DNA (Figure 10) , indicating that the gene from 
which pLesq-Hyd was transcribed is present in a single copy in 
the L. fendleri genome. 

Expression of pLesq-Hyd in Transgenic Plants 

There are a wide variety of plant promoter sequences 
which may be used to cause tissue-specific expression of 
cloned genes in transgenic plants. For instance, the napin 
promoter and the acyl carrier protein promoters have 
previously been used in the modification of seed oil 
composition by expression of an antisense form of a desaturase 
(Knutson et al. 1992) . Similarly, the promoter for the 0- 
subunit of soybean /?-conglycinin has been shown to be highly 
active and to result in tissue-specific expression in 
transgenic plants of species other than soybean (Bray et al., 
1987) . Thus, although we describe the use of the L. fendleri 



kappa hydroxylase promoter in the examples described here, 
other promoters which lead to seed-specific expression may 
also be employed for the production of modified seed oil 
composition. Such modifications of the invention described 
here will be obvious to one skilled in the art. 

Constructs for expression of L. fendleri kappa 
hydroxylase in plant cells are prepared as follows: A 13 kb 
Sail fragment containing the pLesq-Hyg gene was ligated into 
the Xhol site of binary Ti plasmid vector pSLJ44026 (Jones et 
al., 1992) (Figure 11) to produce plasmid pTi-Hyd and 
transformed into Agrobacterium tumefaciens strains GV3101 by 
electroporation . Strain GV3101 (Koncz and Schell, 1986) 
contains a disarmed Ti plasmid. Cells for electroporation 
were prepared as follows. GV3101 was grown in LB medium with 
reduced NaCl (5 g/1). A 250 ml culture was grown to OD 600 = 
0.6, then centrifuged at 4000 rpm (Sorvall GS-A rotor) for 15 
min. The supernatant was aspirated immediately from the loose 
pellet, which was gently resuspended in 500 ml ice-cold water. 
The cells were centrifuged as before, resuspended in 30 ml 
ice-cold water, transferred to a 30 ml tube and centrifuged at 
5000 rpm (Sorvall SS-34 rotor) for 5 min. This was repeated 
three times, resuspending the cells consecutively in 30 ml 
ice-cold water, 3 0 ml ice-cold 10% glycerol, and finally in 
0.75 ml ice-cold 10% glycerol. These cells were aliguoted, 
frozen in liquid nitrogen, and stored at -80C. 
Electroporations employed a Biorad Gene pulsar instrument 
using cold 2 mm-gap cuvettes containing 40 pel cells and 1 fxl 
of DNA in water, at a voltage of 2.5 KV, and 200 Ohms 




resistance. The electroporated cells were diluted with 1 ml 
SOC medium (Sambrook et al. , 1989, page A2) and incubated at 
28°C for 2-4 h before plating on medium containing kanamycin 
(50 mg/1) . 

5 Arabidopsis thaliana can be transformed with the 

Agrobacterlum cells containing pTi-Hyd as described in Example 
1 above. Similarly, the presence of hydroxylated fatty acids 
in the transgeneic Arabidopsis plants can be demonstrated by 
the methods described in Example 1 above. 

io;3 

32 Constitutive expression of the L. fendleri hydroxylase in 
transgenic plants 

jjf A 1.5 kb EcoR I fragment from pLesq-Hyg comprising the 

entire coding region of the hydroxylase was gel purified, then 
15;;! cloned into the corresponding site of pBluescript KS 

ji: (Stratagene) . Plasmid DNA from a number of recombinant clones 
was then restricted with Pst I, which should cut only once in 
the insert and once in the vector poly linker sequence. 
Release of a 920 bp fragment with Pst I indicated the right 
20 orientation of the insert for further manipulations. DNA from 

one such clone was further restricted with Sail, the 5« 
overhangs filled-in with the Klenow fragment of DNA polymerase 
I, then cut with Sac I. The insert fragment was gel purified, 
and cloned between the Sma I and Sac I sites of pBI121 
25 (Clontech) behind the Cauliflower Mosaic Virus 35S promoter. 

After checking that the sequence of the junction between 
insert and vector DNA was appropriate, plasmid DNA from a 
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recombinant clone was used to transform A. tumefaciens 
(GV3101) . Kanamycin resistant colonies were then used for in 
planta transformation of A. thaliana as previously described. 

DNA was extracted from kanamycin resistant seedlings and 
used to PCR-amplify selected fragments from the hydroxylase 
using nested primers. When fragments of the expected size 
could be amplified, corresponding plants were grown in the 
greenhouse or on agar plates, and fatty acids extracted from 
fully expanded leaves, roots and dry seeds. GC-MS analysis 
was then performed as previously described to characterize the 
different fatty acid species and detect accumulation of 
hydroxy fatty acids in transgenic tissues. 

EXAMPLE 3 - OBTAINING OTHER PLANT FATTY AC YL HYDROXYLASES 

In a previous patent application, we described the ways 
in which the castor fahl2 sequence could be used to identify 
other kappa hydroxylases by methods such as PCR and 
heterologous hybridization. However, because of the high 
degree of sequence similarity between Al2 desaturases and 
kappa hydroxylases, prior art does not teach how to 
distinguish between the two kinds of enzymes without a 
functional test such as demonstrating activity in transgenic 
plants or another suitable host (e.g., transgenic microbial ox 
animal cells). The identification of the L. fendleri 
hydroxylase provided for the development of criteria by which 
a hydroxylase and a desaturase may be distinguished solely on 
the basis of deduced amino acid sequence information. 

Figures 9A-B show a sequence alignment of the castor and 
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L. fendleri hydroxylase sequences with the castor hydroxylase 
sequence and all publically available sequences for all plant 
microsomal A 12 fatty acid desaturases. Of the 384 amino acid 
residues in the castor hydroxylase sequence, more than 95% are 
5 identical to the corresponding residue in at least one of the 

desaturase sequences. Therefore, none of these residues are 
responsible for the catalytic differences between the 
hydroxylase and the desaturases. Of the remaining 16 residues 
in the castor hydroxylase and 14 residues in the Lesquerella 
10 f 2 hydroxylase, all but six represent instances where the 

o;l hydroxylase sequence has a conservative substitution compared 

with one or more of the desaturase sequences, or there is wide 
ffi variability in the amino acid at that position in the various 
desaturases. By conservative, we mean that the following 
15 P amino acids are functionally equivalent : Ser/Thr, lie/ Leu/ 
Val/ Met, Asp/ Glu. Thus, these structural differences also 
cannot account for the catalytic differences between the 
desaturases and hydroxylases. This leaves just six amino acid 
residues where both the castor hydroxylase and the Lesquerella 
20 hydroxylase differ from all of the known desaturases and where 

all of the known microsomal A 12 desaturases have the identical 
amino acid residue. These residues occur at positions 69, 
111, 155, 226, 304 and 331 of the alignment in Figure 9. 
Therefore, these six sites distinguish hydroxylases from 
25 desaturases. Based on this analysis, we claim that any enzyme 

with greater than 60% sequence identity to one of the enzymes 
listed in Figure 9 can be classified as a hydroxylase if it 
differs from the sequence of the desaturases at these six 
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positions. Because of slight differences in the number of 
residues in a particular protein, the numbering may vary from 
protein to protein but the intent of the number system will be 
evident if the protein in question is aligned with the castor 
hydroxylase using the numbering system shown herein. Thus, in 
conjunction with the methods for using the lesquerella 
hydroxylase gene to isolate homologous genes, the structural 
criterion disclosed here teaches how to isolate and identify 
plant kappa hydroxylase genes for the purpose of genetically 
modifying fatty acid composition as disclosed herein and in 
the previous application (No. 08/320,982). 

In considering which of the six substitutions are solely 
or primarily responsible for the difference in catalytic 
activity of the hydroxylases of this invention and the 
desaturases, we consider it likely that the substitution of a 
Phe for a Tyr at position 226 may be solely responsible for 
this difference in catalytic activity because of the known 
participation of tyrosine radicals in enzyme catalysis. Other 
substitutions, such as the Ala for Ser at position 331 may 
have effects at modulating the overall rate of the reaction. 
On this basis we envision creating novel kappa hydroxylases by 
site directed mutagenesis of A 12 desaturases. We also 
envision converting Al5 desaturases and A9 desaturases to 
hydroxylases by similar use of site-directed mutagenesis. 

CONCLUDING REMARKS 

By the above examples, demonstration of critical factors 
in the production of novel hydroxylated fatty acids by 



expression of a kappa hydroxylase gene from Castor in 
transgenic plants is described. In addition, a complete cDNA 
sequence of the Lesquerella fendleri kappa hydroxylase is also 
provided. A full sequence of the castor hydroxylase is also 
given with various constructs for use in host cells. Through 
this invention, one can obtain the amino acid and nucleic acid 
sequences which encode plant fatty acyl hydroxylases from a 
variety of sources and for a variety of applications. 

All publications mentioned in this specification are 
indicative of the level of skill of those skilled in the art 
to which this invention pertains. All publications are herein 
incorporated by reference to the same extent as if each 
individual publication was specifically and individually 
indicated to be incorporated by reference. 

Although the foregoing invention has been described in 
some detail by way of illustration and example for purposes of 
clarity of understanding, it will be obvious that certain 
changes and modifications may be practiced within the scope of 
the appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Somerville, Chris 

Broun, Pierre 

(ii) TITLE OF INVENTION: Production of Hydroxylated Fatty Acids in 

Genetically Modified Plants 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CUSHMAN DARBY & CUSHMAN, LLP 

(B) STREET: 1100 NEW YORK AVENUE, N.W. 

(C) CITY: WASHINGTON 

(D) STATE: D.C. 

(E) COUNTRY: USA 

i=l (F) ZIP: 20005-3918 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.44 KB storage 

\% (B) COMPUTER: IBM PC 

ZL (C) OPERATING SYSTEM: DOS 5.0 

(D) SOFTWARE: Word Perfect 5.1 

(vi) CURRENT APPLICATION DATA; 

;== (A) APPLICATION NUMBER: unassigned 

S (B) FILING DATE: 

=SJ (C) CLASSIFICATION: 

U (vii) ATTORNEY / AGENT INFORMATION: 

H (A) NAME: Paul N. Kokulis 

H= (B) REGISTRATION NUMBER: 16,773 

(C) REFERENCE /DOCKET NUMBER: 1220/213781 

(viii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202 861 3000 

(B) TELEFAX: 202 822 0944 

(2) INFORMATION FOR SEQ ID NO:l 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TATTGGCACC GGCGGCACCA TTCCAACAAT GGATCCCTAG AAAAAGATGA AGTCTTTGTC 60 
CCACCTAAGA AAGCTGCAGT CANATGGTAT GTCAAATACC TCAACAACCC TCTTGGACGC 120 



- 65 - 



ATT CTGGTGT TAACAGTTCA GTTTATCCTC GGGTGGCCTT TGTATCTAGC CTTTAATGTA 180 

TCAGGTAGAC CTTATGATGG TTTCGCTTCA CATTTCTTCC CTCATGCACC TATCTTTAAG 240 

GACCGTGAAC GTCTCCAGAT ATACATCTCA GATGCTGGTA TTCTAGCTGT CTGTTATGGT 3 00 

CTTTACCGTT ACGCTGCTTC ACAAGGATTG ACTGCTATGA TCTGCGTCTA CGGAGTACCG 3 60 

CTTTTGATAG TGAACTTTTT CCTTGTCTTG GTCACTTTCT TGCAGCACAC TCATCCTTCA 420 

TTACCTCACT ATGATTCAAC CGAGTGGGAA TGGATTAGAG GAGCTTTGGT TACGGTAGAC 480 

AGAGACTATG GAATCTTGAA CAAGGTGTTT CACAACATAA CAGACACCCA CGTAGCACAC 540 
CAC 



543 



if2) INFORMATION FOR SEQ ID NO: 2 

'M (i) SEQUENCE CHARACTERISTICS: 

]fi (A) LENGTH: 544 nucleotides 

It! (B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

yJ (ii) MOLECULE TYPE: cDNA 

S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

^TATAGGCACC GGAGGCACCA TTCCAACACA GGATCCCTCG AAAGAGATGA AGTATTTGTC 60 

ScAAAGCAGA AATCCGCAAT CAAGTGGTAC GGCGAATACC TCAACAACCC TCCTGGTCGC 12 0 

ATCATGATGT TAACTGTCCA GTTCGTCCTC GGATGGCCCT TGTACTTAGC CTTCAACGTT 180 

TCTGGCAGAC CCTACAATGG TTTCGCTTCC CATTTCTTCC CCAATGCTCC TATCTACAAC 240 

GACCGTGAAC GCCTCCAGAT TTACATCTCT GATGCTGGTA TTCTAGCCGT CTGTTATGGT 3 00 

CTTTACCGTT ACGCTGTTGC ACAAGGACTA GCCTCAATGA TCTGTCTAAA CGGAGTTCCG 3 60 

CTT CTGATAG TTAACTTTTT CCTCGTCTTG ATCACTTACT TACAACACAC TCACCCTGCG 420 

TTGCCTCACT ATGATTCATC AGAGTGGGAT TGG CTT AG AG GAGCTTTAGC TACTGTAGAC 480 

AGAGACTATG GAATCTTGAA CAAGGTGTTC CATAACATCA CAGACACCCA CGTCGCACAC 540 
CACT 



544 



(2) INFORMATION FOR SEQ ID NO: 3 
(i) SEQUENCE CHARACTERISTICS: 
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(A) 
(B) 
(C) 
(D) 



LENGTH: 
TYPE: 

STRANDEDNESS : 
TOPOLOGY : 



1740 nucleotides 

nucleotide 

single 

linear 



(ii) MOLECULE TYPE: 



genomic 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGAAGCTTT ATAAGAAGTT AGTTTTCTCT GGTGACAGAG AAATTNTGTC AATTGGTAGT 

GACAGTTGAA GCAACAGGAA CAACAAGGAT GGTTGGTGNT GATGCTGATG TGGTGATGTG 

TTATTCATCA AATACTAAAT ACTACATTAC TTGTTGCTGC CTACTTCTCC TATTTCCTCC 

GCCACCCATT TTGGACCCAC GANCCTTCCA TTTAAACCCT CTCTCGTGCT ATTCACCAGA 

SGAGAAGCCA AGAGAGAGAG AGAGAGAATG TTCTGAGGAT CATTGTCTTC TTCATCGTTA 

jjTAACGTAAG TTTTTTTTGA CCACT CAT AT CTAAAATCTA GTACATGCAA TAGATTAATG 

JicTGTTCCTT CTTTTGATAT TTTCAGCTTC TTGAATTCAA GATGGGTGCT GGTGGAAGAA 

jlAATGGTTAC CCCCTCTTCC AAGAAATCAG AAACTGAAGC CCTAAAACGT GGACCATGTG 

=AGAAACCACC ATTCACTGTT AAAGATCTGA AGAAAGCAAT CCCACAGCAT TGTTTCAAGC 

I'GCTCTATCCC TCGTTCTTTC TCCTACCTTC TCACAGATAT CACTTTAGTT TCTTGCTTCT 

hACTACGTTGC CACAAATTAC TTCTCTCTTC TTCCTCAGCC TCTCTCTACT TACCTAGCTT 

I-GGCCTCTCTA TTGGGTATGT CAAGGCTGTG TCTTAACCGG TATCTGGGTC ATTGGCCATG 

AATGTGGTCA CCATGCATTC AGTGACTATC AATGGGTAGA TGACACTGTT GGTTTTATCT 

TCCATTCCTT CCTTCTCGTC CCTTACTTCT CCTGGAAATA CAGTCATCGT CGTCACCATT 

CCAACAATGG ATCTCTCGAG AAAGATGAAG TCTTTGTCCC ACCGAAGAAA GCTGCAGTCA 

AATGGTATGT TAAATACCTC AACAACCCTC TTGGACGCAT TCTGGTGTTA ACAGTTCAGT 

TTATCCTCGG GTGGCCTTTG TATCTAGCCT TTAATGTATC AGGTAGACCT TATGATGGTT 

TCGCTTCACA TTTCTTCCCT CATGCACCTA T CTTTAAAG A CCGAGAACGC CTCCAGATAT 

ACAT CTCAGA TGCTGGTATT CTAGCTGTCT GTTATGGTCT TTACCGTTAC GCTGCTTCAC 

AAGGATTGAC TGCTATGATC TGCGTCTATG GAGTAC CGCT TTTGATAGTG AACTTTTTCC 

TTGTCTTGGT AACTTTCTTG CAGCACACTC ATCCTTCGTT ACCTCATTAT GATTCAACCG 

AGTGGGAATG GATTAGAGGA GCTTTGGTTA CGGTAGACAG AGACTATGGA ATATTGAACA 

AGGTGTTCCA TAACATAACA GACACACATG TGGCTCATCA TCTCTTTGCA ACTATACCGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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ATTATAACGC AATGGAAGCT ACAGAGGCGA TAAAGCCAAT ACTTGGTGAT TACTACCACT 1440 

TCGATGGAAC ACCGTGGTAT GTGGCCATGT ATAGGGAAGC AAAGGAGTGT CTCTATGTAG 1500 

AACCGGATAC GGAACGTGGG AAGAAAGGTG TCTACTATTA CAACAATAAG TTATGAGGCT 1560 

GATAGGGCGA GAGAAGTGCA ATTATCAATC TTCATTTCCA TGTTTTAGGT GTCTTGTTTA 1620 

AGAAGCTATG CTTTGTTTCA ATAATCTCAG AGTCCATNTA GTTGTGTTCT GGTGCATTTT 1680 

GCCTAGTTAT GTGGTGTCGG AAGTTAGTGT TCAAACTGCT TCCTGCTGTG CTGCCCAGTG 1740 

AAGAACAAGT TTACGTGTTT AAAATACTCG GAACGAATTG ACCACAANAT ATCCAAAACC 1800 

GGCTATCCGA ATTCCATATC CGAAAACCGG ATATCCAAAT TTCCAGAGTA CTTAG 1855 

Mf2) INFORMATION FOR SEQ ID NO: 4 

M (i) SEQUENCE CHARACTERISTICS: 

ffl (A) LENGTH: 384 amino acids 

II! (B) TYPE: amino acid 

M (C) STRANDEDNESS : 

ffl (D) TOPOLOGY: linear 

T (ii) MOLECULE TYPE: protein 

it (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

jT Met Gly Ala Gly Gly Arg lie Met Val 

20 

Glu Thr Glu Ala Leu Lys Arg Gly Pro 

40 

Val Lys Asp Leu Lys Lys Ala lie Pro 
50 

lie Pro Arg Ser Phe Ser Tyr Leu Leu 

70 

Cys Phe Tyr Tyr Val Ala Thr Asn Tyr 

Leu Ser Thr Tyr Leu Ala Trp Pro Leu 
100 

Val Leu Thr Gly lie Trp Val lie Gly 

120 

Phe Ser Asp Tyr Gin Trp Val Asp Asp 



10 

Thr Pro Ser Ser Lys Lys Ser 

30 

Cys Glu Lys Pro Pro Phe Thr 



Gin His Cys Phe Lys Arg Ser 
60 

Thr Asp lie Thr Leu Val Ser 

80 

Phe Ser Leu Leu Pro Gin Pro 
90 

Tyr Trp Val Cys Gin Gly Cys 

110 

His Glu Cys Gly His His Ala 



Thr Val Gly Phe lie Phe His 



- 68 - 



130 140 
Ser Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg 

150 160 
His His Ser Asn Asn Gly Ser Leu Glu Lys Asp Glu Val Phe Val Pro 

170 

Pro Lys Lys Ala Ala Val Lys Trp Tyr Val Lys Tyr Leu Asn Asn Pro 

180 190 
Leu Gly Arg lie Leu Val Leu Thr Val Gin Phe lie Leu Gly Trp Pro 

200 

Leu Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Phe Ala 

210 220 
Ser His Phe Phe Pro His Ala Pro lie Phe Lys Asp Arg Glu Arg Leu 

230 240 
Gin lie Tyr lie Ser Asp Ala Gly lie Leu Ala Val Cys Tyr Gly Leu 

250 

Tyr Arg Tyr Ala Ala Ser Gin Gly Leu Thr Ala Met lie Cys Val Tyr 

260 270 
Gly Val Pro Leu Leu lie Val Asn Phe Phe Leu Val Leu Val Thr Phe 

280 

Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Thr Glu Trp 

290 300 
Glu Trp lie Arg Gly Ala Leu Val Thr Val Asp Arg Asp Tyr Gly lie 

310 320 
Leu Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His 

330 

Leu Phe Ala Thr lie Pro His Tyr Asn Ala Met Glu Ala Thr Glu Ala 

340 350 
lie Lys Pro lie Leu Gly Asp Tyr Tyr His Phe Asp Gly Thr Pro Trp 

360 

Tyr Val Ala Met Tyr Arg Glu Ala Lys Glu Cys Leu Tyr Val Glu Pro 

370 380 
Asp Thr Glu Arg Gly Lys Lys Gly Val Tyr Tyr Tyr Asn Asn Lys Leu 



(2) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 amino acids 

(B) TYPE: amino acid 
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( C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

10 

Met Gly Gly Gly Gly Arg Met Ser Thr Val lie Thr Ser Asn Asn Ser 

20 30 
Glu Lys Lys Gly Gly Ser Ser His Leu Lys Arg Ala Pro His Thr Lys 

40 

Pro Pro Phe Thr Leu Gly Asp Leu Lys Arg Ala lie Pro Pro His Cys 

50 60 
jPhe Glu Arg Ser Phe Val Arg Ser Phe Ser Tyr Val Ala Tyr Asp Val 

i 70 80 

icys Leu Ser Phe Leu Phe Tyr Ser lie Ala Thr Asn Phe Phe Pro Tyr 

90 

ille Ser Ser Pro Leu Ser Tyr Val Ala Trp Leu Val Tyr Trp Leu Phe 

100 110 
■j Gin Gly Cys He Leu Thr Gly Leu Trp Val lie Gly His Glu Cys Gly 

120 

J! His His Ala Phe Ser Glu Tyr Gin Leu Ala Asp Asp He Val Gly Leu 

130 140 
lie Val His Ser Ala Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser 

150 160 
His Arg Arg His His Ser Asn He Gly Ser Leu Glu Arg Asp Glu Val 

170 

Phe Val Pro Lys Ser Lys Ser Lys lie Ser Trp Tyr Ser Lys Tyr Ser 

180 19° 
Asn Asn Pro Pro Gly Arg Val Leu Thr Leu Ala Ala Thr Leu Leu Leu 

200 

Gly Trp Pro Leu Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp 

210 220 
Arg Phe Ala Cys His Tyr Asp Pro Tyr Gly Pro lie Phe Ser Glu Arg 

230 240 
Glu Arg Leu Gin He Tyr He Ala Asp Leu Gly He Phe Ala Thr Thr 

250 

Phe Val Leu Tyr Gin Ala Thr Met Ala Lys Gly Leu Ala Trp Val Met 
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260 270 
Arg lie Tyr Gly Val Pro Leu Leu lie Val Asn Cys Phe Leu Val Met 

280 

lie Thr Tyr Leu Gin His Thr His Pro Ala lie Pro Arg Tyr Gly Ser 

290 300 
Ser Glu Trp Asp Trp Leu Arg Gly Ala Met Val Thr Val Asp Arg Asp 

310 320 
Tyr Gly Val Leu Asn Lys Val Phe His Asn He Ala Asp Thr His Val 

330 

Ala His His Leu Phe Ala Thr Val Pro His Tyr His Ala Met Glu Ala 

340 350 
Thr Lys Ala He Lys Pro He Met Gly Glu Tyr Tyr Arg Tyr Asp Gly 

f 360 

?Thr Pro Phe Tyr Lys Ala Leu Trp Arg Glu Ala Lys Glu Cys Leu Phe 

370 380 
'val Glu Pro Asp Glu Gly Ala Pro Thr Gin Gly Val Phe Trp Tyr Arg 

* Asn Lys Tyr 



^ ) INFORMATION FOR SEQ ID NO: 6 
° (i) SEQUENCE CHARACTERISTICS: 

H ( A ) LENGTH: 383 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Gly Ala Gly Gly Arg Met Pro 
20 

Glu Thr Asp Thr Thr Lys Arg Val 

40 

Val Gly Asp Leu Lys Lys Ala He 
50 

He Pro Arg Ser Phe Ser Tyr Leu 



10 

Val Pro Thr Ser Ser Lys Lys Ser 

30 

Pro Cys Glu Lys Pro Pro Phe Ser 

Pro Pro His Cys Phe Lys Arg Ser 
60 

He Ser Asp He He He Ala Ser 
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70 80 



Cys Phe Tyr Tyr Val Ala Thr Asn Tyr Phe Ser Leu Leu Pro Gin Pro 

90 

Leu Ser Tyr Leu Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 

100 110 
Leu Thr Gly lie Trp Val lie Ala His Glu Cys Gly His His Ala Phe 



120 



Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 

130 140 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 

150 160 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 

170 

Gin Lys Ser Ala He Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 

180 190 
Gly Arg He Met Met Leu Thr Val Gin Phe Val Leu Gly Trp Pro Leu 

200 

Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Phe Ala Cys 
210 220 



His Phe Phe Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu Gin 

230 240 
He Tyr Leu Ser Asp Ala Gly He Leu Ala Val Cys Phe Gly Leu Tyr 



250 

Arg Tyr Ala Ala Ala Gin Gly Met Ala Ser Met He Cys Leu Tyr Gly 

260 270 
Val Pro Leu Leu He Val Asn Ala Phe Leu Val Leu He Thr Tyr Leu 

280 

Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp Asp 

290 300 
Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He Leu 

310 320 
Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His Leu 

330 

Phe Ser Thr Met Pro His Tyr Asn Ala Met Glu Ala Thr Lys Ala He 

340 350 
Lys Pro He Leu Gly Asp Tyr Tyr Gin Phe Asp Gly Thr Pro Trp Tyr 
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360 

Val Ala Met Tyr Arg Glu Ala Lys Glu Cys lie Tyr Val Glu Pro Asp 

370 380 
Arg Glu Gly Asp Lys Lys Gly Val Tyr Trp Tyr Asn Asn Lys Leu 



(2) INFORMATION FOR SEQ ID NO: 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
O(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

m io 

;jj Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 

S 20 30 

S Glu Thr Asp Asn lie Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 

h 40 

m Val Gly Glu Leu Lys Lys Ala lie Pro Pro His Cys Phe Lys Arg Ser 

Ll 50 60 

fi He Pro Arg Ser Phe Ser His Leu He Trp Asp He He He Ala Ser 



70 80 
Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro Asn Pro 

90 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 

100 110 
Leu Thr Gly Val Trp Val He Ala His Glu Cys Gly His Ala Ala Phe 

120 

Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 

130 140 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 

150 160 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Arg 

170 

Arg Ser Gin Thr Ser Ser Gly Thr Ala Ser Thr Ser Thr Thr Phe Gly 
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180 

Arg Thr Val Met 

Leu Ala Phe Asn 
210 

His Phe His Pro 

lie Tyr lie Ser 

Pro Tyr Ala Ala 
260 

Val Pro Leu Leu 

gGln His Thr His 
il 290 

:.fTrp Leu Arg Gly 
jijAsn Gin Gly Phe 

5phe Ser Thr Met 

il 340 
;-Lys Pro lie Leu 

Lys Ala Met Trp 
370 

Arg Gin Gly Glu 



Leu Thr Val Gin 
200 

Val Ser Gly Arg 

Asn Ala Pro lie 

230 

Asp Ala Gly lie 

Val Gin Gly Val 

lie Val Asn Gly 
280 

Pro Ser Leu Pro 

Ala Leu Ala Thr 
310 

His Asn lie Thr 

Pro His Tyr His 

Gly Glu Tyr Tyr 
360 

Arg Glu Ala Lys 
Lys Lys Gly Val 



Phe Thr Leu Gly 

Pro Tyr Asp Gly 
220 

Tyr Asn Asp Arg 

Leu Ala Val Cys 
250 

Ala Ser Met Val 

Phe Leu Val Leu 

His Tyr Asp Ser 
300 

Val Asp Arg Asp 

Asp Thr His Glu 
330 

Ala Met Glu Ala 
Gin Phe Asp Gly 

Glu Cys lie Tyr 
380 

Phe Trp Tyr Asn 



190 

Trp Pro Leu Tyr 

Gly Phe Ala Cys 

Glu Arg Leu Gin 
240 

Tyr Gly Leu Leu 

Cys Phe Leu Arg 
270 

lie Thr Tyr Leu 

Ser Glu Trp Asp 

Tyr Gly lie Leu 
320 

Ala His His Leu 

Thr Lys Ala lie 
350 

Thr Pro Val Val 
Val Glu Pro Asp 
Asn Lys Leu Xaa 



(2) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 09 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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10 

Ser Leu Leu Thr Ser Phe Ser Tyr Val Val Tyr Asp Leu Ser Phe Ala 

20 30 
Phe lie Phe Tyr lie Ala Thr Thr Tyr Phe His Leu Leu Pro Gin Pro 

40 

Phe Ser Leu lie Ala Trp Pro lie Tyr Trp Val Leu Gin Gly Cys Leu 

50 60 
Leu Thr Arg Val Cys Gly His His Ala Phe Ser Lys Tyr Gin Trp Val 

70 8° 
Asp Asp Val Val Gly Leu Thr Leu His Ser Thr Leu Leu Val Pro Tyr 

90 

Phe Ser Trp Lys lie Ser His Arg Arg His His Ser Asn Thr Gly Ser 

100 HO 
Leu Asp Arg Asp Glu Arg Val Lys Val Ala Trp Phe Ser Lys Tyr Leu 

120 

Asn Asn Pro Leu Gly Arg Ala Val Ser Leu Leu Val Thr Leu Thr lie 

130 140 
Gly Trp Pro Met Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp 

150 I 60 
Ser Phe Ala Ser His Tyr His Pro Tyr Arg Val Arg Leu Leu lie Tyr 

170 

Val Ser Asp Val Ala Leu Phe Ser Val Thr Tyr Ser Leu Tyr Arg Val 

180 190 
Ala Thr Leu Lys Gly Leu Val Trp Leu Leu Cys Val Tyr Gly Val Pro 

200 

Leu Leu lie Val Asn Gly Phe Leu Val Thr lie Thr Tyr Leu Arg Val 

210 220 
His Tyr Asp Ser Ser Glu Trp Asp Trp Leu Lys Gly Ala Leu Ala Thr 

230 240 
Met Asp Arg Asp Tyr Gly lie Leu Asn Lys Val Phe His His lie Thr 

250 

Asp Thr His Val Ala His His Leu Phe Ser Thr Met Pro His Tyr His 

260 270 
Leu Arg Val Lys Pro lie Leu Gly Glu Tyr Tyr Gin Phe Asp Asp Thr 

280 

Pro Phe Tyr Lys Ala Leu Trp Arg Glu Ala Arg Glu Cys Leu Tyr Val 
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290 300 
Glu Pro Asp Glu Gly Thr Ser Glu Lys Gly Val Tyr Trp Tyr Arg Asn 

Lys Tyr Leu Arg Val 



(2) INFORMATION FOR SEQ ID NO: 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 302 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

H io 

Kphe Ser Tyr Val Val Tyr Asp Leu Thr lie Ala Phe Cys Leu Tyr Tyr 



20 30 
itVal Ala Thr His Tyr Phe His Leu Leu Pro Gly Pro Leu Ser Phe Arg 

40 

JfGly Met Ala lie Tyr Trp Ala Val Gin Gly Cys lie Leu Thr Gly Val 

!!J 50 60 

M^Trp Val Val Ala Phe Ser Asp Tyr Gin Leu Leu Asp Asp lie Val Gly 

M 70 80 

Leu lie Leu His Ser Ala Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr 

90 

Ser His Arg Arg His His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu 

100 110 
Val Phe Val Pro Lys Val Ser Lys Tyr Leu Asn Asn Pro Pro Gly Arg 

120 

Val Leu Thr Leu Ala Val Thr Leu Thr Leu Gly Trp Pro Leu Tyr Leu 

130 140 
Ala Leu Asn Val Ser Gly Arg Pro Tyr Asp Arg Phe Ala Cys His Tyr 

150 160 
Asp Pro Tyr Gly Pro lie Tyr Ser Val lie Ser Asp Ala Gly Val Leu 

170 

Ala Val Val Tyr Gly Leu Phe Arg Leu Ala Met Ala Lys Gly Leu Ala 

180 190 
Trp Val Val Cys Val Tyr Gly Val Pro Leu Leu Val Val Asn Gly Phe 
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200 

Leu Val Leu lie Thr Phe Leu Gin His Thr His Val Ser Glu Trp Asp 

210 220 
Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly lie Leu 

230 240 
Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His Leu 

250 

Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Val Glu Tyr 

260 270 
Tyr Arg Phe Asp Glu Thr Pro Phe Val Lys Ala Met Trp Arg Glu Ala 

280 

Arg Glu Cys lie Tyr Val Glu Pro Asp Gin Ser Thr Glu Ser Lys Gly 

290 300 
: Val Phe Trp Tyr Asn Asn Lys Leu Ala Met Glu Ala Thr Val 



!) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

10 

Met Gly Ala Gly Gly Arg Met Thr Glu Lys Glu Arg Glu Lys Gin Glu 

20 30 
Gin Leu Ala Arg Ala Thr Gly Gly Ala Ala Met Gin Arg Ser Pro Val 

40 

Glu Lys Pro Pro Phe Thr Leu Gly Gin lie Lys Lys Ala lie Pro Pro 

50 60 
His Cys Phe Glu Arg Ser Val Leu Lys Ser Phe Ser Tyr Val Val His 

70 80 
Asp Leu Val lie Ala Ala Ala Leu Leu Tyr Phe Ala Leu Ala lie lie 

90 

Pro Ala Leu Pro Ser Pro Leu Arg Tyr Ala Ala Trp Pro Leu Tyr Trp 

100 no 
lie Ala Gin Gly Ala Phe Ser Asp Tyr Ser Leu Leu Asp Asp Val Val 
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120 

Gly Leu Val Leu His Ser Ser Leu Met Val Pro Tyr Phe Ser Trp Lys 

130 140 
Tyr Ser His Arg Arg His His Ser Asn Thr Gly Ser Leu Glu Arg Asp 

150 160 
Glu Val Phe Val Pro Lys Lys Lys Glu Ala Leu Pro Trp Tyr Thr Pro 

170 

Tyr Val Tyr Asn Asn Pro Val Gly Arg Val Val His lie Val Val Gin 

180 190 
Leu Thr Leu Gly Trp Pro Leu Tyr Leu Ala Thr Asn Ala Ser Gly Arg 

200 

Pro Tyr Pro Arg Phe Ala Cys His Phe Asp Pro Tyr Gly Pro lie Tyr 

210 220 
Asn Asp Arg Glu Arg Ala Gin lie Phe Val Ser Asp Ala Gly Val Val 

230 240 
Ala Val Ala Phe Gly Leu Tyr Lys Leu Ala Ala Ala Phe Gly Val Trp 

250 

Trp Val Val Arg Val Tyr Ala Val Pro Leu Leu lie Val Asn Ala Trp 

260 270 
Leu Val Leu lie Thr Tyr Leu Gin His Thr His Pro Ser Leu Pro His 

280 

Tyr Asp Ser Ser Glu Trp Asp Trp Leu Arg Gly Ala Leu Ala Thr Met 

290 300 
Asp Arg Asp Tyr Gly lie Leu Asn Arg Val Phe His Asn lie Thr Asp 

310 320 
Thr His Val Ala His His Leu Phe Ser Thr Met Pro His Tyr His Ala 

330 

Met Glu Ala Thr Lys Ala lie Arg Pro lie Leu Gly Asp Tyr Tyr His 

340 350 
Phe Asp Pro Thr Pro Val Ala Lys Ala Thr Trp Arg Glu Ala Gly Glu 

360 

Cys lie Tyr Val Glu Pro Glu Asp Arg Lys Gly Val Phe Trp Tyr Asn 
370 

Lys Lys Phe Xaa 



(2) INFORMATION FOR SEQ ID NO: 11 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

10 

Trp Val Met Ala His Asp Cys Gly His His Ala Phe Ser Asp Tyr Gin 

20 30 
Leu Leu Asp Asp Val Val Gly Leu lie Leu His Ser Cys Leu Leu Val 

40 

Pro Tyr Phe Ser Trp Lys His Ser His Arg Arg His His Ser Asn Thr 

50 60 
Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys Lys Lys Ser Ser 

70 80 
E Ile Arg Trp Tyr Ser Lys Tyr Leu Asn Asn Pro Pro Gly Arg lie Met 

90 

Thr lie Ala Val Thr Leu Ser Leu Gly Trp Pro Leu Tyr Leu Ala Phe 

100 110 
Asn Val Ser Gly Arg Pro Tyr Asp Arg Phe Ala Cys His Tyr Asp Pro 

120 

„-Tyr Gly Pro lie Tyr Asn Asp Arg Glu Arg lie Glu lie Phe lie Ser 

130 140 
Asp Ala Gly Val Leu Ala Val Thr Phe Gly Leu Tyr Gin Leu Ala lie 

150 160 
Ala Lys Gly Leu Ala Trp Val Val Cys Val Tyr Gly Val Pro Leu Leu 

170 

Val Val Asn Ser Phe Leu Val Leu lie Thr Phe Leu Gin His Thr His 

180 190 
Pro Ala Leu Pro His Tyr Asp Ser Ser Glu Trp Asp Trp Leu Arg Gly 

200 

Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly lie Leu Asn Lys Val Phe 

210 220 
His Asn lie Thr Asp Thr Gin Val Ala His His Leu Phe Thr Met Pro 
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(2) 



INFORMATION FOR SEQ ID NO: 12 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 
(xi) SEQUENCE DESCRIPTION: 
GCTCTTTTGT GCGCTCATTC 



20 nucleotides 
nucleotide 
single 
linear 

DNA 

SEQ ID NO: 12: 



(2) INFORMATION FOR SEQ ID NO: 13 

qi) SEQUENCE CHARACTERISTICS: 

fl (A) LENGTH: 20 nucleotides 

ffl (B) TYPE: nucleotide 

ffl (C) STRANDEDNESS: single 

IH (D) TOPOLOGY: linear 

ylii) MOLECULE TYPE: DNA 

=:(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CGSTACCAGA AAACGCCTTG 



(2J INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TAYWSNCAYM GNMGNCAYCA 



(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: 
(xi) SEQUENCE DESCRIPTION: 
RTGRTGNGCN ACRTGNGTRT C 



DNA 

SEQ ID NO: 15 
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